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ttfoz-mmwt Lx-mmicmw.~?z>-mm^mz 
ffixzct znm t?% mxm 7 fB«co^:^i?«s 0 

10] *ffl| t ¥f§M14 4: 

«"T 535:* fc»**oaeo L frfc i: <E> fc±IB*l§B 

imxmi 2] ±K8Pmtfr&tt* mti^^y^ 

±m&mxmicis^xWimztirzm®frt>ffife'W-y 
<onm:mm z mmmmxm t , 

gg** t cOWJSM«^I2«$ nfc*K«©r-r ;l/fc 
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[W#«15] ±IHli«3b^ttm^n5^MO^ 10 
■[W««16] ±IBE**JWlg-CHu «r«KiS1*r 

4EttO*^B«^ffi. 

±E*MWiyifc kirmmz ntcmmfr t>mm>v> - v 30 
jjemMBHXUKc <t o ram 2 nfcmjfc* * - 
*fe8-rs««««4ja«yifc, 40 

9E«W#yD^7Ao 50 



mm 2 0 0 3 - 4 4 0 8 0 
4 

©■•sttHj r 5 mmmmm t , 

*S&Si:*nsI?«y KBIIh:llf7*«'Sfc»OWfll^D^ 
7^ES*nfcES«(*. 

/xtiaafflox^^ 6 * s ■s&n-vib % c t t 

[00 0 1] 

i&wiTmmc&Btzuxy mm, Mmc mm* 

[000 2] ~~ 

aU-**«ffiSci#'y molSfflo*y> (Industri 
al Robot) T$>^>tc 0 

[0 0 0 3] ^iST«, APBO/^-h^"- 1 LT4ffi% 
JiiBtefettSAWiSSft^a-rs^fflojP'y howa^ 
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^>$n/c rAHBSLl Xtt rAHB&J Dlvh (Humano 
id Robot) «ON5£f*ftt3$y b& BEtcSlfflfc^no 

[0004] wssBftatfy ntxpictt, rgj tciaa 

JSfiLfciS«Hc«bTiiiflWffla**-rci:fc:J:oT, H 
«M«BfcLTAaL;rdSIH©»«*^Lfc»>* A£L 

[0.00 5] «rfc % W»fr&8»Lfc*J**IMLT:fl: 20 

t4, TOin^-y hJiWKt,^-v^;i/3ytra-* 

[0 0 0 6] fl£*0*J«B»0^ffiTtt, *SI©l8Si: 
©J:5ail«Lfc*ffiOf8«*K«-r5«^tti, fgft 30 

-gtsns*^ Kaa»«ns*\ wre*ftv\i 
[0007] r-usuiu fcv^JWSWicfc**:, rjt 
ffiiiu 3VBSEfflt»*es»?nTv»a»ntf, rata 
jiij m rjtfiiiu 0Uf4\ rjtfijiitt, 

[0008] tfjtrBMWi#ma»fcRrtii2:'r«A«> 

LU (Phonone-like unit) 

[0 0 0 9] LfctfoT, eiRffl^*(c¥ffi%S»-r% 



fffl2003-4 4 0 8 0 
6 ' ~~ 

tin?ftlf£\,\ fc/cU #ISS/^#;I/fcPLUJiJt%ii 
fcn?*3*&fc^$©tt, rjbajiu ^ Tk i t a s h 
i n a g a w a J fcV*3£E& 0J*.tf, 

So 

[0 0 10] %<Dtcibs utfv bmm<D£r>lc*—*"- 
FOJ:34Aa¥a*fll*.T^a^ig£fcH\ §^tL 
T«»bfcJ|l«©^ft*WB«UT5|c*D*BOP L U 
C©*&» tf-^-^x/l/ (ga 
rbage model) *BfflbTBKl/TV*S 0 tf-^-v 7 ^ 
r^l/^ti, 02 0 (a) RtfB2 0 (b) IC^T*} 

fc, ^%a«<DS*«*#tS[fc*s ra*j 

*s©»&o ) *7 , ;n?»*. 
[001 1] Wktomprnfflkm-eit. **-^-> ,; ef 

[0012] fcjsu rssRj t tpluj i: 

14, IJI?|5l8co^igi:LT^fflLTt3D, rpLU?H 

(4, %»© tpluj tfjaKsnsc 

K©5W*3HELfcfc©?a&*. 
[0 0 13] 

ci:^, StMSJK (0<JAtf, »il©/s/») 14, 
SEROSA:*, *J»KIB*ao5«c«»3V*Ht4-3 
[0 0 14] #tC, D#>y hSBlCtfmBmiB^Xffl 

[0015] mwmcs man. r*fcua^t>j %k 
««-&s*&Kov^T^-rfc^ sa«is«»» rh i t 

o tsu na no gaJ^Tj t a s — n a 
ga :j ©ct^tc r#fcLft*«toJ taftlUTCS 

jE»*PLUW##$nTVSc:i:t*5fci6, c© 
#S§*K«f 5 BOWll jWUTT S ^ 5 BHjS*^ o 

[0016] fflibfcAfctt9JOA«»ni;*»«WL 
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[0 0 1 7] *WHiO«***?lcgeiJ|UT« 

PtsmaucHLx rduwincfrsftv^. j ta^tfc 

LT&> Thitotsunanoga 
[0 0 18] Mtc, ccoi-Mctf— ^— ^f^Kio 

/wjFic r<ift!B*a , ra>+tt+i!c'+'e i r+*»j 

[0 0 19] *ct»*«Wtt, tOJ:9ftfle*0llfllfc 

•a-sfij^p y ^ ARtf c ©©jay p ?5 &#aas £ n 

[0 0 2 0] 
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toymmmmmm^-ftvt vxmmnrcmm 
m\z^xmfewmm^w&w-y<mwk.% 

mmg7—7Mzm~3^xtt$L, mzntcwmKi 
<d%mx^<d&4 Kit hxmm<o Lfrtc tft^icwm 

%pmmmmicttMicmm?%imw®^m£Zffix. 

[00 2 1] c<DJ: 5 fcatf-y hSBfi, HttflSll^a 

mum k> <om%ic¥zmmmmT-7/i>icm~3^i5- 

afcfc^TH»£ftfcif^©g*»«fc*Jt«U at 

^<Dhiprcxh% £ Lx^mmmnm^mm^mm 

[00 2 2] CCT-^fc, F?r^#— :/©ffl#&i, 

[0023] ttz, *%micmz$&®wmwiz. mm 
mmm<of%%<DLfrrc£<Dtt£Mm^Ptmmmm 
^xtmznrcgpmmm&m^m^ mmmmm 
<om^£(o^mmmwmm^7 L ~y^£ ltib« 

%m^®£, MU^®fcto^xw.mzntc : gfrt>%js 
*wm?z>mpm®frwi£. mwmzmcto^xmh 

^a££ti§;t3o 

[0 0 2 4] C©J:5***ffia«BElBtt. H^ISIS^a 

mm *) om&icz&mmm^T-fMzm-i^xtt* 
l, m^ntzmmmoom^^^icnLx^ 
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[0 0 2 5] ca*, m&'^-yvmmzs 

[0 0 2 6] Sfe, *8Weflte£*KiWffiW\ 

«u m&i&^p\mzto&Lrc*¥<D&m<DLfrrc 20 

Hi:***. 3. 

[0 0 2 7] COi^ftX^BHWffitintf, Httte 

ttPamMtt^rtm* ntz^p^pwM t mt 30 

'[0 0 2 8] C^T'iRf(C % jffi£/<*-:/©iIflKi, 
[0 0 2 9] mic *KWfC«S*J»ypyv^tt, tt 

B««isf«:j:oTaa«nfcx*^&«ts«nsa»[5i 40 
imztirczmmgy—yMcm-i^xtt* u 

Tifest Lx&mtm#m<mM<D\sfrrct<Dttfim& 
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[0 0 3 0] ZCXftlC. m^^*->©afl«4» 

±a©*J1»rD^i*«:BBSjlS«*:k:ia»L/T««-r«. 
[003 1] 

i%momm<mwa ^m<o-m^mtLx^.tu^ 

c©o#y hSBJi, t*>±&i:f*# 

[0 0 3 2] COD#7hm fi^^£Dffi«B# 
jJWHT'ifcD, rtttttli («»>, JKL#, SO\ sfcb^ 

[0 0 3 3] COn^yhRItt, #tc r^J ^MLfc 

m». mwm. tmsp, 

[0 0 3 4] c©at;ybgl(i, jaHO«iR*B«t'r 
(Charge-Coupled Device) ^^^k^Mir^o 

[0035] *nm<DBMt LXTrstuXy hmm&. 
®®wmmwRVgpwmmmztffiz.xtsr>, ccda 
* 7ic&\ t >ximt!nrcm&frt>f%7£>W'-><om@L% 

te<omVi, HIP, ffis .*W**©to©iii«**^f6ft 

[0 0 3 6] J-XT, *58WO-«fi8«li: LT^-Ta^«y 
[0 0 3 7] *H*0«»'T»tt, P#*y hSS 1 «, 0 
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1 (c*T<fc?fc, r^cj *mLrcBWm^®%^y r- 

M^?n> JfK*«5a--y h 2 0AMSffKS»a- -y h- 4 

[00 38] vY2 £W\ M 2 iCTfit <fc 9 

ic, CPU (Central Processing Unit) 10. DRA 
M (Dynamic Random Access Memory) IK 
ROM (Read Only Memory) 12, PC (Personal Com 10 
puter) ii-^-fy $-7 i-rxiHlSSl 3Rt>WSaS 
SKI 4*W^W1 5*ftLTffiSK:SttSttSCi: 
(c«kf3»fi!t^n/cny hD-;l/gpi 6£, dCDnjjvyh- 
SB 1 ©■lAJBkLTO/^y^y l 7 fctfiRttSttT^ 
5. S§#gi5a-<y h 2fCfi, P#<y r-gS 1 0|r) 

[0 0 3 9] Sfg^x-y h 4 fcte, 
£/c46©C C D (Charge Coupled Device) *^720 

^ ^ffl#A^e»<D nrt-sj ^ rnp< j tvofc^aw 20 

At, U$vh&m\<D Tgj iCfflM^LED (Ligh 
t Emitting Diode) (BaH*"*") ^tfBfJtffiHte^ft* 

nsES^nro^o ccd**72<h;J\ asatx-vh 

4 ©[6] < £[Rifc&sa^f**ffi5£©Hft 5 c i: 

[0 0 4 0] §Pg|5rL->y h3A~3D©H»»$K § 30 
pglU.- -y h 3 A~3 D tmWtife.- -y h 2 £CD!*Sa5 

Slgfla- >y h 4 tfflttgpa.- >y h 2 fc©iHS»#, 
KJSaS3.-yh5fcRS5At<Datt*»Ktt, SE&g 
£fc#cD7??-;xX-2 2 5, ~2 5n R^fyya^ 
-2 2 6, -26, ^n^nERsnTv^. 7*f- 

aI-?2 5, ~2 5„ ti. 0IJ*fcf, -9— 
J«fcLTWl/CV3o ■9--# ; &-^OK»tJ:0, 
a" -y h 3 A~3 Dff*J»SftTB«©8», «^tt» 

[0041] cn^ftjUg-ty-y- 1 8, ftM-tv-fr 1 40 

9, *»yf~feV-9-2 K 2, v^2 3, 

X£-#2 4M##fy->3)(-^2 6, ~26„ m 

<D&m-tyymmc i e DM§7vfai-^ 2 5 , 

~2 5.li, ^tt^ttttfS-t^N^ 7, -27. £ft 
LTn>hn-;l/gpi 6©^SaaiHgSl 4t&ffiZ 

ccDA^7 2os(f^'yfu i 7«, ^n^wt 
[0042] mmumm&i ±iBo^-t>+j-^p. 

cne^n^ftp^M* 1 5%/tLt 50 
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dram 1 1 tonmfcGLmmx&mzo zrzmnm 
sinus 1 4 « x cntfctK^f'U 1 7A^#*j&$n 

cmdrami 1 ^©m^ffiBtcts^-rso 

[00 4 3] CO^^ICLTDR AM 1 1 tc^^nfc 

gMx-*«> C PU 1 0#3«ED#* h»H H&fttt 
»J»*fr3ISk:«ffl*n*. 

[00 4 4] CPUlOti, P#>y r-SB 1 ©WBtflft 
A£ftfcMBB£K:*5^T, 777-yaROMl 2K*&lifl 
^nfeSiJUyp^v^^^WbT, DRAM1 Hcte 
Xtt v C PU 1 0M\ 0 1 CH^C»l^JBi*» 
a - «y h 2 CO P C A - F 7, P -y h fc«#S ftfc¥#fM 

1 3*^LTtt»Hi LTD RAMI 1 KttW-TSo 

[0 0 4 5] C PU 1 Oti, ±iSO* 5 Kfi^ffla®8S 

1 4<tt)DRAMl 1 (cj®^^$n§g-b>"9-r- 

[0 0 4 6] HlC CPU10f±, C©i»JK*S»kDR 

am 1 1 imMLrcM®7°v75i±t\zm-3<fim*tii 

fe?%o CPUlOtt, ^Wfc&fe^&'l^TTZ* 
aX-^2 5, ~25» CDctJ^^^SilTST^^aX 
-^^rSgift-r^ciitc.koT, »-7h4 

Lfc 0 , =&M)g|5ax >y h 3 A TbS 3 D UT*fr 
^^D-fSo Sfe, CPUl Ott, jgSSKiSCT^jB 
r-^*4jSU €^©aisIIESl 4^LT7stf-A2 
4fC^-r5„ $fc, CPUlOti, ±xEC0LEDc0^ 
*T • »fl"%«jT?-r««#££jSU L E D%^*TLfc 0 
m*TL/ct>-t£o 
[0 0 4 7] l/c, CPUlOti. ±atO«t9tP^>y 

o^e>cos^icjsi;TP4-*>y bfcftffis-eso 

[0 0 4 8] cneoa*W«:«fi!ctC<fcoT % P#^y> 
[0 0 4 9] Sic, P#>y hgfi 1 tt, B»Ufc«et 

micmmtztzitxDffi&t lt, /p*a$a-»y h 2 cdp 

yhP-;l/»l 6fc, HflOfJirBMtf 1 0 0%«h*TV^ 
£0 Mt&PKRtf 1 0 0(i, 0 3tc^-TJ;^(c, 

sagp 1 1 0 i:, s^e»as i2ot> 1 3 

0 1, mmmm$.^fm& not, n^it fg^gp 1 

5 0i:^WtTi,^o B^»S«ffli»»i:tt, B4fcjf>-r 
TPLUylJj t^rlB^bfcx-^I/TfeSo C£0S¥#^r 
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[0 0 5 0] uttWfc ttts^asp i los, ^ 2 
§jsic*t"fsis«;&£jfi-rs. wessssi 1 o«> a 

So 

[00 5 1] 3?BM* lZOtt, a-1f©«»**WS 
^S$l 1 OT^ST't^it, W*tf, r*7Fi 

a? 1 2 oa, jy*wic{±, gpmmmmw 121.11 

*r;l> 12 2, WI*t;I/ 12 3, %WMIi& I 2 4m 
frt>&% 0 BStfffiSPl 2 4 TB, BtRfc&B&ttttft 
0»W*««*l*IBIBH|-eff*)ti*. 05*. tf, W&ftfc* 

tfcn&©SEft*3tf«Jilitt*ti*. JB»»5M»fKtt. ft 
B^mftVi (LPC) , ffia7-Uxag» (FFT) , 
/^H/^X7-<;1/^ (BPF) VtfJBl^&tUS. 
[0 0 5 2] £*KNtf 12 0li, filtf^l/ 1 2 2 
mm*T)\> 1 2 3 t^ffl^T, 1 2 4 T*£/£ 

^ftfcLTtt* 09*.tf, inv;l/n7tf;l/ (Hidden M 
arkov Model : WT\ HMMilffifo ) WJBV&tl 

[00 5 3] HMMkli, 

znx^zmm<D%m<DLfrrc£'&m-rz>mmmm$& 
©L^fckovy^^^fcffiffl-rs^ife-e**. hmm 

[0 0 5 4] ^SrtrxMJ, (PLU), * 

ffUtf, b#ih<d<K£ ffej • «Vj • f^j 
• raj • r*>j) • ray • r^j - taj £#{tt£-f 
ftf;i/*fflv^i$, cn6%ffl*-&t>^T^-r5 
cktioT, r«n,\i , , r*3« «t ^ j , 

*B^SB»TH±, SfjRkPLU (Phonone- 
1 Ike unit) i;3&KgiJU*V^{£fflLTV>S 0 £«Sft 

fcSWi, ft (plu) ( P L u t 

[0 0 5 5] HMMtCfcntf, CV&oKffiJ&ZtltzM 
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)Vfr*> rm^j *®j8.?%rc#><D®ntLT, wet* 
hi 2 3 tgpwmmmw 1 2 1 t^isns. s?s 

ggRffi&ftl 2 l£tt, B«Wftka5§#e**<Kf 

•jhtLXKhtzmmx'UbK). nm^Tfr 1 2 3 tit, 
mm t mm t tommvLfrrcommzm l it t, cd-t* 

[0056] j-xTtc^-rfi?ijT'«, r#i§j mmm 

-mmtLxmct&xzzo 
[00 5 7] tit. *mwmvit. mfrfozi tit. 

H*|g©ll***S8SUfct>6*<!S:Xl4*^*^- 
©SkUTfflV\ r5WOU*»fcJ fctt, R*fi*0* 

[0 0 5 8] 0)J*.tf, r~B#fre>, ~^f$T'J kV^X 
%»9«^tOV>T^A5. CCD^, ff, TO (€ 
5) J ri 0,vfc) J • • • F2 4 (KUo^A,) J 
kv^#l§k, f^r (C) J • Tfr'e,} • r*-pj_i:v> 

^iKDfnfmciLt, attf;v 1 2 2*#ia-r 

[0059] r («^%s-r*ss) j , nsj , 
30 r*>p,j % r (a^^S-T¥i§) j , n^j , r$-pj 

[0060] c<DmFmsm&9 1 2 1 tnm^n 1 
2 3k%fflv^THMM^jiffl-r^;:k{cj;or, ri«? 

40 [00 6 1] §^ffi«J6gtefetts*370H-mi, S 
g^rr/l/l 2 ZtctoT^^tiSfSW&xnZi:, 
l-fg^r";!/! 2 3{cJ;oT^A?>n^-SH§W*Xn7k 

[0 0 6 2] Wl§W^Xn7k«, amf?>n 

<@©#i§p^cDg^5i$, ^mmmmcm^x^-^ 

[00 6 3] **3, W^tx-'Wi, Xffi^n^^AEptC 

50 &m*mm%BM?zi>mict>. mmw?* (mmtth 
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[0064] mm. Jft«*art-*iB«ni»» ^ntc< 

*ic r<jft£>+{i+ifi:j o*»«**ffl*LTfe< 
&mmt, p «*s*> I a, ifc i) TfeSo 

[0 0 6 5] 3 0tt, 1 Otf 

■4i«Lfc«W/S*->**IB«!)«ff»cffi» , r'*. 09*. 10 
if, tt&em&i i o# r^%fe^fcigs+ fw>*j 

.j6-rs»^^>«&suc pu i oimzttt* 

4fr6Htfj-f So 

[0 0 6 6] BftftK&HMttl 4 0«, CCD** 
1 4 1 K«-3^T»tS„ £3^*- 

yf-^^-x i 4 nctt, o-6*^, fi^ti-r, m 20 

9J»14 0tt, CCD**5 2 ofrbcoxtimmtx.? 
/^-yf-^^-x 1 4 i fct&wsnT^sB*^* 

s£*?ij£^iif s 0 

[006 7] fS^WIS^^gP 1 5 0 tt, iflffiliffl 

<mws) £»f s Q mtf* rjba 30 

fc^5«*fi45*afe«U P LU^iJT' Tk i t a s h i 
nagawaj t^o%=g<DLfrtc 

[0 0 6 8] ^RR^JUttx-^l/ 1 5 1 fck, 04 fc^ 

LfcT-TVl/T'&So JR14t«> rifc£j , r^fluj , 

[0 0 6 9] BBffl%$#Btt$ i 4 o T«$nrc5: 
^9»Jtfc©r-77Wc^S: nTV^*«*tt, c<Dt—7 40 

Bttx-TVi/ 1 5 Hi, mMI l 2 l fctttfcfc 
icfflStfSo 

[0070] wmnm^mmma^ mmmm^mm 

^©a±©IP^T'±Pg^fe§ (mtf 6^5 5 3 6fg) 
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£&T*tSo 

[00 7 1 ] ^m^-f—ffi 1 5 2 it, me icjjvr «fc 

a fc , ©*M5*<£i£ 2 tilt x-TVl/T' 

&S 0 iB# J f»7;l/75'^-yh J f>#«?Sti:R*«*SE 

[00 7 2] HWMt^r— 7V1/ 1 5 3 tt, 2 ~D(Dt—-? 

[0073] n{ttt(c& miticmtzmmtte, 
r • • • ts 3 j r • • • r • • • *>— r - r • 

SftSo j§ffi©*aaiJi:ii, 0>Rfcf, r 0 a n ;i|pj ©R&£ 
rt&tffr (sun) j t r<*> (p) j fc©*g^64 

SSH"Ji:tt, 3S*»©ffi*ft "e" 
"e" gfttt, a^bav^t>0li:l3©S^*^R# 
■fS^©«H"J^feSo "take" Tx-^J i: 

l^M*<E*%f*#-rsiBK, "a"{c^LTrx-j 
t^5K#«**f*^U "ke" IcjULT, ¥tc r^/j 

[0074] ^(c, fflm&*\cmm#nz&»t&m 

[00 7 5] UCfe^T, *l§e^© 

^f7t±, CT^-tfs n^-y hgH 1 tt» a-^fAWS 
LT*Ke«*- K»c»fT-r*. C©B^K % MttXZ 

[0 0 7 6] Xf-y7S 2tcfcV^T, n^-yhgll 

gHi©ccDA*^2o ©bu (cii^-r s g©^at>* 

ct*«, rjcflyiU fcv^d*»%Wi:LTBlW-r«. ^ 
-Wc£-?~Cim-£tl5*.mt. g|?T'fc<5«T'tP- 
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W-#7J , TkitashinagawaJ 3F©ftn 
[0 0 7 7] ZtV?S 3tcfcl/^T, ntt-yhgll 

4'\it&, A*H^fctt>lc5gKtf*ofc«^tt, 

[0 0 7 8] ttCi&Jc, X^a^O^W^fc^V^TBi 

n#-y bSB l Ic^MtKRfBRV 14 014, 
CCD*^72 0(cfcl/>T®&£ftfciB&{i:H©«fc5& 

:5:^j^#*nT^s*^^>'^->"r-^^-x i 

4 ltS^T, i^BW (OCR : Optical Characto 
r Recognition) f*. CCT% MkJ&kft&BNV 1 
4 0 14, ' *^BS«S*©£»W 1 0 i&fc*^* 

(C*t L/C HtiW I IJ . V> ? BttKS**W5 ftfc»&tt, 20 

rttjyiu taw. 

[0 0 7 9]Ml^T, Xfy^SStfe^T, D^-y h 
S«lfc:fctt53£SHllffi&JS»l 5 0(4, Xf'^S 4 

««oL*»fc (ss##) &£$-r s» 
/<*-y««affl«n«. so 

[0 0 8 0] Xf77"S 6fCfe^T, Dt:7hgll 
14, ±SE©J:5K:±*a«nfcX^5»]KW"rSf8«Ol/^ 
tc (R3WF) jWEIA^S*\ X(4, *fiS©»E#/3©? 

fc (Mftfi) ©#©*£«', r»i^73(4, oo 
TIEU^T*ffro J ©j;9fcHP«g-rs 0 riEt 

[008 1] Sfc, JWObfrft (R*^T) *^»S6iD 

fes^{4, ^n^ntco^r rsi^73t4, oor-r 40 

t>\ J (D&olcUmZo a—W HELiM ■>#> rti 
V\l 3?©JS»fcjgLfclS#£&^LTX7 t v:/S 7K 

[0 0 8 2] a-lfft»6 l"VM,>;tj tSOiS»*»tfcii 

7S 2gU<l47.x>y^S 4<DSQ3f T*M£o 
[0 0 8 3] «±©»8fcJ:oT, JWS^ffiOW*** 
£Lfc&, Xf>y7*S 7Kii&, JCrtiLfc^ylJfci:© 



*oBBWSft%fl6ffl-rs. co**Mfc»jS"r* plu 
pmmicfcwz^zrctbv&m, mar, mpmm^o 

[0 0 8 4] T.x-y 3 Kifcl^T, i-M 

l u5»E?©fg«tt»&«Hfc<fc < sbe-tsc 
[0085] i^Bia^m^ejtt^^n 

73) s. c©«t5tbTf*6nfcttaosis 

©Upfc (S!*73) fc7^*2 3t*^TWILfta- 

ifjb^^^fc^Tv^yyt" ScfcftioT, ±$© 

[0 0 8 6] fcfcfcRBtffcofe**, 
>y7S 8(C*5V>T, D#<y hSB 1 tCfcttSB&tf&S;* 
^BiiS$ 1 4 0 (4, C CDA^72 Otcfc^TtWIlSft 

1 4 0(4, :^KBMS«©(8»*MofcRDfcfcfc^« 
^, ffia©Mffi%^fo 

[0 0 8 7] Stt/^T, Xff^S9ej5^T, o#7h 
SB 1 fcfett*fl«f1t«4<Sa5 1 5 0t4, T.r-y-/S 8 
T*©Bl$S3l£LTt#&nfc&^J(C*fLT, X^iJ© 

«LTfi^©L^fc (tt*^f) «H*#sn*. B^S^tl 

fcX^5>J^«»fe»«^Stf/Xtt 1 OOX?yiJ(c*f L 
T«a©K*^^WIg^il^(C(4, ^T©«e^*-> 

[0 0 8 8] ^(C, Xf77*S 1 0(C*5V^T, X^ylji: 
fS^©L^rc (0E*7tF) fcjb^, -«fWfCfi©ffi»fflas 
•^4^-rSo C£DS?#%J-XT, Sfrl^filfflffiBRfflSF* 
fcEfo CRtf, CCDA^720(c4oT«Kn/c: 

rdtftjllj k^*?«W*t*fHWl 4 0K43 

i/^T, rjt^iij t rtfc^;iu ©23SOfcK»«nfci: 
•rsc e^ttfflffijsiBJ 1 5 0(4, rjtajiu t 

So C (OiI^©frM*l§fflBiSfflS?#^ia 8 \cfjkt 0 
[0 0 8 9] Xf77*S 1 ^ct3^^T, *frM*l§fflBii 
fflS^«^fflv^T, a— »f^6©»^fc:WfC*^BII* 
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r-y^S 1 1 fcfcttS^FKWIfctt, §r&$l§fflBttffl 

i, PLU5»Jj^Sia:S.«^ttg'JO#8lfc*41'. 
[0090] @8T'«\ CC-ta»«nTV^3*IB 

(2.oo rjtaiu ttgijJMi o**»6. a- 
m-tc^T-fcs, «*i:LT, #ais/y!i?;i/fcPLujij 

[0 0 9 l ] «r«*»fflB»ffl»*©1'Jb>6#li^># 

^fcpLujijfc©ffltfi*£Sttfc&, xtv?s 7tc*3 

■5o 

[0 0 9 2] JSUdSLfc*JSfcJ;oT» 0#y hSBi 20 
tt, WWittl 2 1 IC|E1t*tlTV^ftV^*B*«f 

[0 0 9 3] ±$LfcXx-y7S 5£7.-f>y7S 9W 

[0094] xf7ys2iicteK> Httsas 
r-j *»9igue* r^.-j m^tso -&mim 30 

[0095] -73. 7>7-v7s 2 ncfev^T, B$i$as 

[0 0 9 6] **JlJ#*»BI»iltt'f— ^ 1 5 1 40 
S 2 4) „ S/c, maittftJRttr-^H 5 1 {c#fgcD 

[0097] j&mwmwifrm&T-fji isnct 
$nT^£i^*&, xf77"s 2 5k^t» 

ft • #SJS'J>i£> **R*r-7^ l 5 2 KScK 
TBMMR**IRflW-S. 50 
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[0098] m&-®.m • »9ffl[«/jNffittt. *iss!^ 
arc**. w*tf* xtj-j&mff nmwwffli 
n&©«*^t>-e-*>6 rjbp a piiis?Huj awetsct 

[0 0 9 9] #?R#r-7;l/ 1 5 2fcS-3<SI 
T-fe^o l o©**fcttW»©K»<g45«« 

[0 10 0] ig^T> Xf77'S 2 6Cfcl/>T, 

[0101] Xf7/S 22, X-r-y^S 2 4, Xf7 

[0 l 0 2] CCT\ J£8g$g&JSteT-7/l/ 1 5 l CO 
V^TWIBKiBiWr*. e^B«fflS¥» 1 2 1 (c^lg^fr 
MaaUfc^tt-etts sffi^r^l 2 3fcIE®£ftfc»ii 

aiB©8i«aB'j%3iffl"rsci:tt-p*ftv>. w*fcf» 
rjbp D p;iu «$^snffl^ff i 2 i KSiiossufcfc u 

o t«^ss%(c a aitr^saffl-r s c t ^r-^ s 0 
[0103] wi§ ; eT;wc-&stiTv>^i/^{c 

[0104] tvxti*: mm*&%mm*m&icLfc 
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CO l 0 5] Sfg^Wctt, JMg|iS©ggliE¥$BBft 
T 3 frfr 9 tc ?7 7. IHoa««* 5 X t *fg t (DM 

CO l 0 6] *9SS8»H:o^Tt>, £©*5XfcJRT 10 
CO l 0 7] ±i£<0J:3fc*-5fc*toffiflJ*7M/?tt, 

CO 10 8] Lfctf^T. a»BMlcJ:«JMBSflUc43 20 

**. ts*^ *«we»6nfes^i (*«) # 

»ffiBE*Rttr-^H 5 1 ic^Snti/^^> CO 
r-"7;l/0*ltt«*»6^7X*lli*ai#T»#S. &ft> 
0 5 fc jjstffil-Ctt, JSttflit JBtt% 1 o LfrI2S$ LT& 
r<flfii&>. <@W£P3X <1R£>J © 

< r*tJ^<«l*>*»ffl'rSo 30 

co 1 09] **iarctt, -**-foKW-rs«fc!3 

jWa±-r5*£jW53o *£7\ ISHfflS?*© rJWiS/ 

y#;ki W?\ #fgS?*Jgtt7-7/!/ 1 5 i co r^fgj 

co i io] ttionwj-pii, mmmc&vzmfe^ 
±a5tfc«fc^fci* ot*j»j) %rt*©fg 

*B0l»>oll»fi**W4U tt#Stt;fc8aJGI9©R 

CO l l l] c©<fc3fcu BBE/<*«-:/i:l/tX*WW 
fttc t, ©<!r -7. (c MIS L T t frtz Z v 7, * - 

^^vHtiWlhTt*. 50 
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CO l l 2] tc6X\ *nM<DBMtLXjrstzi$v 

•y bSBT'fe^o D#<y h^B 1 (cftltSffltt/a?? 
A©V7h'7x7fllJStt, 0 1 OK^t c 
©SytPTW^Att, ±SiLfc<fc-5tc > W7y^ 
R'OMl 2fc1S*fi£nTftt>: OJp-y t»SHl©*Sfi 

CO 1 1 3] 0 1 OKft^Ts f'WX • K5Y'< • U 
-f* 3 0tt, $W7*a^A©*Tffi«fcffiBU *» 

>y b 3 1 ft»6«U**tlT^* 0 COIH, §f/WX • 
F5-f>*f4, CCDA^720 (02) ^-frU©!! 

«©3>t;a-*-effl^6n*M-K'>x7fciS«7^ 
/ \- F "7 x 7fr ?> ©fU o i&#*s # r«yi*fT ? o 

CO 1 14] $fc» D*f^^-^.^i^ 
b3 2li > f/WX- K5>T/<' Wt3 0©STfi7Jl 

5. -25. H*>x7t7^-fe7.-rS/ci60'l' 

v ?-7 x -< x^nm-r 3 y 7 h 7 x mxftz^-? 

h^xTSfT'^^^y-^^-^-vS 4 4i, fls©a^© 

f/wx- F^^Mrsv? f7x7S¥t*&&t' 

/WX • K^-TA • v*— S^-v 3 5 P#<y FSB 1 
©fili^f It 5 V 7 h ^ x 7S T'& § ff ^ > K • p 
^•y h 3 6 fcfrfcflWiSttTVSo 
CO 1 15] v^-v^> • *zn?x? h 3 7 tt, £7^ 
h • v^-^-v 3 8R0 : -9--tr7, • 3 9 

)!)^M?ntV^o *75*x* h • T*-S^-V 3 8 
tt, PjKx-f >y^ • - h 3 2, 5h> 

• 9x7 • W-\'4 0, RXfT-fyr-i/ay • 

4 lfc#*nS#V7h'7x7»Ofil(l J f»»7*eaT 
§V7h7x7St*fe^ V^-»3 9 

y*U*-K2 8 (02) {cftMZtltc3*?i/3 
y~7 r-r/I/tESStiT^S^^x 7 h F^o^li 
®fcS^V^T^7*v ? x^ h©gJttSBa-rSV7 h -7 
x7®fT*fe5o 

CO 1 1 6] 5 K;l/ • 9x7 • W-^4 0f±> P;Jvr-r 
7* • -9"-^ • *7*^x ^ h 3 2CD±fit«fciuBb, H 

^^-r5V7h'7x7P^P)#lfi!t$nTV^o $ 
fc, 77°U^— >a > • U-f-^4 Hi, 5K/l/-9x7 

• O0±{MNc(MU F;l/- -7x7 • 
W-^4 0*«J«'rS*V7hr7x7»fC*oTJ6a« 

nrcwm&mcm-i^xtiXv vmm. \ wmztitfet 

Sfc»Oy7h'7x7»^6*J«StiT^S. 
CO 1 1 7] 4ft, 5K;l/-9x7- Wt4 0M7 
yj^-v-ay- U-f-\M KDMft4y7h9x7^ 

>&^n^n0 1 i ic7rs? 0 

CO 1 1 8] 5K;l"9x7- U^-¥4 0li, 0 1 It 
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*f g^ajfflx aetttbffi. w&ts&tiim. 
mmm. mtmmm. z&m&m. 

0~5 SMtffcA^-fevyr'r^Xsy^— *tS?a— 
;l/5 9S»*Wr5B»*6 0fc, UJ^tvyf 

m t-v'aVBiffl, CMXMMB. LED 

MTfflRt/^S^fflO&^Wev^-^e 1-6 7 

[0 1 19] Igpf&6 0©*fll<WM J ES?a-;l'5 0~ 10 
5 8(±, D^f-f'^ •■9— /<'*^x*b3 2©^ 
-f-+;l/'B^yh3 3KJ:0DRAMl 1 (02) 

g^>rm£©»&ssL.-<:, sassamsrA^-bTyr 

[0 12 0] AA'bvyf-fWny/^^tyi-;!/ 20 
5 9(±> Cn^i^it^a-^S 0~5 8A^% 

•fflLfcj , rjrc^ttfcj , mp^nfcj > rpsyo 
SfBWiwcAfcj , r»<*f***aL;fcj xtt rp»# 

WfcfctHLfcJ 3F©§ERtfJSH©ttffi J *\ ffiffl#*»6 

ay • u^iM 1 (ctu^f 

[0 12 1] 77U$— S/aV' Hi, 012 

t->'a-yb7 1 „ ^ya-;l/7 2, jRflFe-fJW 3 

[0 12 2] fflfttr^-f^U '7 OfCti, 01 3K 

[0123] ^-lt> cnefTiit^wi, ^n^ftiA 40 

^•fevyx-f £*ny/s— ^t^a- ;l/5 9^&SSKIS 

t*. 

[0 12 4] ftfc, £©£«©««©»&, Stfrtttr 
/Mi, *©fi»*i*jrr«¥ifcfc LT, 014 fc^TJ: 50 
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5ft K CRfiB) NODEo -NOD En fr€> 

te©£f©y- FNODEo -NODE, fCjlg-f Sfrfc 

&/-fnodeo -node, ifCfflzmmtzr-z 
arc 1 -arc. tcWbT*n^nass**ifc»» 

[0 12 5] JlftBty;:, SfrfttfMk ^WfftgB 
OfTft*7*;I/*JPj«1"*y— FNODEo -NOD En 
lC*n^n»B«*T, CtlB/— FNODEo —NO 

d e n nim 1 5 ictr-t * 5 ftttWHia 8 0 lt 

[0 12 6] L©«SIig8 07?(i, ^©/--F-NO 

deo -node, tas^TW&s&ttzxtrfiy 
h (smmso & rA^-r^yF«j mtcmmim 
E^n, ^©»&£mco^T©M&3£HW r-r-^ 

[0 12 7] LrfttfoT; 0 1 5©tt8l»S^8 0T£ 
$nS7-FNODE,oo T'tt, r#-;l/^Hi (B 

all) j j:^ 3 K«M6*#5* enteric, s^b 
z e) j # rofr&ioooj (D&WT'&zct*. mm 

V0*®m (OBSTACLE) J fc^5BMBSJRtf4S. 
*%ST'© r^gfl (D 1 STANCE) J tfi f"0fr> P> 10 

oj ©ttHT?*scfc*^ft©/-Kte3i»-rsfci&©* 

[0 1 2 8] $fc, C©y-FNODE 10 o g 

^n«» « nfc«t»atf 4s«#©/ ^ ^ - * n©5 

^^1/7 3 fcfiWSnfc rSO* (Joy) J > 
rmt (Surprise) J SL<ti mLfr (Sadness) J 

©Mn^©/^^-^ia^ rso*>e>iooj ©seht^s 

fct(ct±flS©/-FK:M^-r5i:t^r-tSJ;-5k:fto 
[0 12 9] £tc, wmwtem OTii, r«i©y-K 

^©s^si^j ©«tfe»t« nwjfe/-Fj ©^jtc^ 

©7— FNODEo -NODE, P>iI&T*£3/- F 
i&j^JEStlT^SfcfcfcK, TA^-r^y F^J , 
r-r-*£j SO* Tx-^WIBHJ ©fftc 823$$ tltc± 
X<D3tftim itch* fc*»T»# S 15 ©&y - F N o 

DEo -node, swwm&ft rtoy- F^N©a 
^st^j ©urto^js-rsffimfc^n^tiHsssn, * 

©y— FNODEo -NODE, KM®? Sf^fcW^f 

^ttT»>^ rffi©y-F^©s^5S^j ©iffltfcits 
rm^ffiftj ©frKgaasnTi^s. ftfc> rfaoy- 

F^©ji^St^J ©fflfcfett**(f ©»*©»« 1 0 0 
[%] tftoTV>5 0 

[0 13 0] LfctfoT, 01 5©«ffi^g8 0T*a 
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tH (BALL) J L, ^Otf-rt'© TS I Z E (At 

4*e>nfcJ§£(c(4, r3o [%] j ©5»i? ry— hn 

0 D E 120 (node 120) J (Cjg&T't, TA 

ct i on i j ©Rii^a^snscfckfts. 

[0131] §t7tt^r;Wi, ^n^nco.fcd&ltfi 
S&S8 0£:LTfE3$2nfc/-FNODEo ~ NO 
DE„ jWaofcR#SJ:3fcUT*fiKSnTfe»>, X* 

tf5*.&nfcfcS3MC, »lSf*y-KNODE. ~N 

o d e „ <Dvmw&%zmm LTW*ttfc*©fr»*ft 

[0 1 3 2] 01 2 (C A * v> a W 1 

fti^fvi^'f'T^'j 7 ocD&ftij^T^wp^n 
ztim-£ft%nm<Do*>. ?#>mibt>tiit9.9iMgL<D 

fc^5, ) %5 K;V - ")i7 • 4 OOHl^y 20 
f^xny/^^^'a-^e 8(CjMttj-fS„ &*5, 
COHWIfcfc^TB, 01 3 {cfct/VCTWcggg 

[0 13 3] £fc frira&A€-^a-;W 1(4, ftW) 
%.7'&lcttit)-k~?yr- -f 5 xnyA— * a— ;l/6 8 

7LftCfc*W- ; a-/l'7 2, J&lftr/W 3&tf 

♦mtTJw 4fcji»]-r5o 

[0 13 4] — 2F, W&5>a-;I/7 2(4, A*J-fcvy 

E[t + l] = E[t] + kexAE[t] 

[0 13 9] &fc, ftB«tS* J fttl*-bW7 L 'f 
y / 2 * V a -;l/ 6 8 6 CDiIftl#&1i »©/ ^ * - 
*ffiOSa»*AE [t] fc£©gg©!B»&#*.5MI: 

ise^*{4 rstfj ©iff»<D/w-*fii©gib*AE 

[t] fc*f?fcK»£4*.S<fc3fcfcoT^ao 
[0 14 0] CCT\ (i)^vyf^X3>M-^t 

1tl8T*&9, 8851**7*;!/ 7 3(4, C©<fc9&1i$fi(c4:o 

Tfe«5«*«fl:«-eSo £n(4, w*tf % ri^^^j t 

V > o fctTE) (i J: 0 & 0 <D&m KMW 5 1 V «t 



[0 13 5] fit, ^W^a-^W 2(4, COmm, 

mwBLTS'ftwn&ttv^-fri i frE>©ii£D(cg-3^ 
t, nwftfc (nt&nfe)'j t^iat^m^m 
wmzi&TZ-e. if^nft (*«>&nfc) j fc*c 

7^77'j 7 oKfettswis-rsftUitr^oitis-rs 

[0 13 6] flKf N SUtr/W 3(4, r«tf (Jo 
y) J , r^gb* (Sadness) J , HSO (Anger) J v 
(Surprise) J , rttS (Disgust) J Rtf r& 
tl (Fear) J <D£fr60<Dliff)(CO^T, £tif®)«(^ 

t, s&itffc-fjw 3(4, <mf)=&ifijo/^^-*<i 

O Wj£OKMUS» i: , «jfii$lffl & tffrlWJft * * S> a - 
[0 13 7] H{*W(c(4, fflHtf/W 3(4, A^l-tv 

suss 2:, ^(Dt^fDu^y h&mi onrnt, mmm 

[t] , mtE<D*(omm(D^ ; 7*-z&*E [t] , z 
<Dti;®(D!g<m*m'tm®.z k . t tr, ( 1 ) st «t o 

T^£Oiim(c43(ti. ; e©tSi)©>'^y-^ffliE [t + 

i] «r»tbu cnzmtEvzommo'^t-zmE 

[t] tKt^Si^tcLT^comWcO/^y-^ffl 
^M«T-T?)o S/c, SM^x;!/? 3(4, cn^|H)1i(cL 

[0 13 8] 

[gu] 

... (1) 

7 2(CfeA^)$nTfet), ^g*i/*a-;l/7 2(4, Z<D 

So 

[0 14 1] ^:*5, frlWS*07-f-F^v*tt» tfltl 

[0 14 2] —77, *fli ; eT;l'7 4(4, raiftS)j (exer 
cise) J , rg1f#: (affection) J , (appetit 
e) J SO" (curiosity) J ©EWc&jlLfc 4 

0©8)5*(COV^T, cn6a*SK:*©a*<03SS*S 
St*UnLT^&o fLT, *^ ; ef f >»l/7 4 

(4, cn&oateRo^v^-*fii*, ^nrnA^-trv 

[0 14 3] ^f*W(c(4, *^*x;l/7 4(4, 



(15) 
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-;l/6 8*^Oii35lSSlcS-5v^THft£©i(0(Sf!: «fe <0M 

asn**©**©*©**©***** i [k] , m 
a©*©»#©^7*-**fci [k] , ^©m*©® 
-g&^-rBftk. 4:lt, msHJWi* (2) «*fflv>r 

#0HJBfcfettS*©»#©/<7*-*ttI [k+l] * 

l[k + l] = l[k] + kixAl[k] 



HI [k] fcK*»*S«fc3fcLT*©»#©^7*-- 
(CUT rft»j *»<*«#©/<5^-*«[*!Bir-r 

So 

[0 14 4] 
[&2] 

... (2) 



"10 1 4 JHMS*StftltfJ-fey:/?^*;*=i 
y ; ^^>*a-^6 8^f> ©si&^tf&akfc©/ * 5 * 
-*ffi©gi«AI [k] fcH<oe«©»»«:#*S^ 

^-^^^a-;l/6 8fr5>©iIftKi, l"$ftj 
-*ffi©gft«AI [k] K*3*Se»*#a.3<fc3fc 

[0 14 6] &*5, ^SWfBt&cfe^Tfi, 4HMK& 

[0 1 4 7] -7?, SFrt/* «>x7 • U-C-Y4 0©Htfj 
■bvyf-f^xny^-^a- /I/6 8fi, 11 lie 

•V4 1 ©fflWHft* ; fc3>a-;w 1 fr<=>#*e>ti£ rtu 
Jtj , rssj , r» < j r h ^ >y * y 9 (#-/i/ 

3R6 9oj*JSf sfi^aati'a-^e i~6 7{c#a 

[0 14 8] *LTCft&fi3*&a*5>a-7l'6 1~6 

7 a, nrn^y f#5*. e> £ ^isfriij 37 > fk 

^2 5, -2 5. (0 2) 3^£1?>-/p}g^fl 
^, *fcf-* 2 4 (02) fr6tH2»-fS^©^/Sx-* 
RtfXtt rgj ©LEDfc4ASOiir-*%«feiaU 
cinp.Of-^^n^f^ >v £ • • ^TVx ? h 

3 2 /l> • P#-y h 3 3&tf{IWHHS8 1 4 

(02) *IP3KftbT#JS-rS7*?-aX-*2 5. ~ 
2 5» X«Xt!-*2 4XttLEDK:JR5Wamj"r*o 

[0 14 9] cotilclTo#» hSIitt, muW^" 

p?7Ak«-^t, sb (rtff) atfHBB © 

5> ©ftjRR tf •* jW* ICJS C fc g 

©fig© L^fc«r«^B«KyiK: «t o T«H©S^ 5B 
3r#7P^I^S!#&$H*?>C£{C«}:oT, 07^ 



[0 1 50] CCDcfcd ftMHTAay^tt, PsffyhS 

tr- (SSffittl) T-rX*, «»*-F) , 3t^S»R75- 
%S©IBg>£,{* (#Rfcf, CD-ROM, MO, CD- 
R, DVD) Wf*.5>nSo IBiS^ftKte, ¥WM 

«ratj*v> 0 ) , i ca-f) m<Dfm®ftt>'5£n 

[0 15 1] cn60*J»^P^7i*tt, »B<£>tti&F 

^^ErU, X(i 1 C A- K«9W)/J>lMb*tifcEil«lt»0 F 

[0 15 2] &*5, ±$LfcHSg©7Ei,©* 

So **SS©®^T'(i, 4S^t«p4-*7 hgBtcML 

[0 15 3] WTfc, *«W©S"JOHflSO««i: LT^ 
■TAIfflSP#yF«B©»IBUcov^TBiWrs. 01 6 
40 RD*0 1 7 Kli, AF^P^-y h^B2 0 OttffiyjRV 
SW©*«*»6l*aLfc«B t *^l/Ti/'»*. Mtc, 01 
8(c(i, c©AraSa#vF8H2 0 0tfJM*rSHSS 

[0 1 5 4] 01 6{c^1-ct^tc, Af»n#7 h£m 
2 00ti, 2*<DM®tffi&2 0 1 %-§S&±Mt* ®9h 

[0 1 5 5] m®2 0 1 fc£8-f SUHSStt, 
-W2 0 2 i:, MKffie>y^«l2 0 3 t, MM@5p-;P 
50 W2 0 4 £1^-5 3gftJg^LTI/>§ 0 
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[0156] $fc % &mmt. mmm^'y^m o 8 

M®5^-y^«l2 1 1 BW3-W2 1 2t. ¥HmW 
tf-y^«l2 1 3 £, ^MB@5p-/N#2 1 4 £, ¥$2 

1 s^-ent^n^ ^a?2 1 msKtt, ?ss* 

©»*S»£B«-£ii&#irafc"efcS. fcJSU ^ 
gP2 1 5©«ffttAPBa!n*v hSB2 0 0®££MI 

[0 15 7] gfc, ##tT yf-«2 0 5 t, 

ft#P-/l/«l2 0 6i:, {*W3-«2 0 7.fc^d3iS 

[0158] tk*«j8*s£*©w»w:, saw 
sn3-t42 1 6t, mmmvvi-m 1 7£, mmmu 

->m2 l 8k, KB«t£'y*ii2 l 9t, SttHSPt? 

2 2 0£, JeMB«n-/l/»|2 2 1 ffig|52 2 
2tt«?n5o *0^a«ffTii, JEB8flS^«yf-«l2 
l 7WMWP-./W2 l 8©SEj(H4, AlfflS!o#yh 

SB 2 0 0 ©KB»ffi«*i£«-r § o Aft©£gP 2 2 2 20 

ABHo^y h«K2 0 0(DEIIEtt, -ifuSS 

jefc-rs. ifttfoT, 6g*jtT«$n 
[0159] BU:*«g-rntf, AHiSptfv vmmz 

OO^fttLTti, ^W-T'3 + 7 x 2 + 3 + 6 x 2 = 3 
^>Hft»OABBaP#y r-£B2 0 0MfU3 2 

ft*«#tMiWKisi:T, s*«-rftt>-BB»a%ict 30 
[0160] ±abfcj:5aAnfflp*v vmm.2 0 0 

Wl>T«IMiiJ«l%fT9ilkfl:HOKW*»6, 7^faX 
[0 1 6 1] 01 9fctt, AlBfflP*^ bSB2 00© 

?ic, AHffln* ? hgH2 o ot±, t hvmMzmm 40 

Uc#ilax7h 2 30, 2 4 0. 2 5 0R/L. 2 
60R/Lt> saian 7 r-BO&IHl&ftfclSSt-f 5 

fc»(!Djfts»j»*tf a w»3.-v h 2 8 o fc-eawte+i 

[0 16 2] AIBl!D*-y h«B2 OO^Ot^ii, 
«»pa-y h 2 8 OfcJ:oTttfiWlcffl»Sft*. MM 
a- >y h 2 8 0&, CPU (Central Processing Uni 
t) ^ Uf fl5±SBB3>*-*^ h (0^L& 
l^) -ettJ8£tlS±Mtm2 8 1 i:, SailslBS-f'AH^ 50 



ptf-y r-SB2 0 OtD^^S^k^r-^^PTVF 

H*frA/?£HiflHtt2 8 2 fcTMftSSnSo COWtla 
x>y h 2 8 0©»H*Bftt, WftlS3£*n*lr\ 0 1 9 
■CttfWHfci-'y h 2 4 Ofcfcttsnt^Stf, SSCa 
-•yh 2 3 0tc»«LTt«fc^. AHfa#y 
MSB 2 0 omfliiJSPa- -y h 2 8 O^SEfif LT, AB 
S p * -y h SB 2 0 0 <D®W t a&SES b < (iittlTS 

[0 16 3] 01 9lC7iiLrcArmvXy h^B2 0 0 

^tCjCoTH^nSo ?%it>*>, SpgUxx-y r-2 30 
tcti, MB8»3-W2 0 2, «»|S^f2 0 3, 1TB 
fi|SD-;l/«2 0 4<D&^£*lI-f 3MB®53-«17^ 
ii-^A! , -^gfl^y3W^ax-*A3 % |f 

[0 16 4] $fc, ®9>XX «y h 2 3 Ofcti, ftgGOtt 
U%ffi&?%rclt><DC C D (ChargeCoupled Device) * 

^ 7 6 nr *\ tu^(cfitB-r -eo 

[0 16 5] $fc, *tSa-7 h 2 4 Ofc-tt, tt^lf 
7fl2 0 5, i*^P-;Hfl2 0 6, i*^3-W2 0 7 

-^A 7 ^ESJnt^S, S/fc, i*^g|5a--yh2 4 
0 ictt, c OAlfflSn 3j?y hSB 2 0 0 ©iSIWBB-t * 
5^fU*«AT^5, CC0/^yx'Jt±, 

[0 16 6] ®EgPOL->yh 2 5 0R/L(i, ±JR 

=L-V hZ51R/Lt, WHfifiX--y h 2 5 2 R/L 

mM3--v h 2 5 3 R/Lfc*B$MI:Snsft<, 
aStT>y^Wl8, Hi8iJP-/i/«l2 0 9, ±®e3-«12 1 
0, WKIpe-y^»2 1 1, mffiB-m2 1 2, 
m¥y?-®2 1 3, ^Bflsa-;HI2 1 4(D&%mi 

fdlT^aX-^A* , ±M3-i7yfaX-lA 
,o , MBfifi^-y^lfiT^^ax-^A.! , WBIPp 
-;l/fS7^5=-ax-^A, 2 , #MBS5t?-yf-Wi7^^ 
ax— ^A. 3 , +^l'M6i5P-;l/«7^f-3.x-^A 

[0 16 7] Sfc, Pg|5rL--yh 2 60R/L{i, AH 
g|5ax<v h 2 6 1 R/Lh, h 2 6 2 R/L 

Egl5ax-y h 2 6 3 R/Ltclffl^b^n?.^ JfiM 
W3-W2 1 6, 8aMgi5e>y^Hll2 1 7, KBS5P— ^ 
«2 1 8, MBM5e<y^llll2 1 9, SmBli5tr-y^lll2 
2 0, £SH«5n-;H*2 2 1 co^^^JS-rSBSBffi 
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H-«l7*?-aX-*Aie , IWtf7fU7^fa 

IfiT^^aX-^Aao , Smi8ii5n-;l/ffi7^f-ax 
fai-n,, As <fcO»*L<tt, ^Tit 
X-ybFtyCfgtgb/c^-l' 7©/^ A C "9— * • 7 * f - a 

x-*T8lj£-fS<:fctfT*£3b 

[0 1 6 8] Egg&axy H2 3CK ft$*gpax -y r- 2 4 10 
0, HWJa^y b 2 5 0, &ISPgl$ax>y h 2 6 O&ifcD 
hfcE, 7^f-aX-^ig»!)$IJffllgP(DilJ$IJ 
®JgP2 3 5, 2 4 5. 2 5 5 R/L, 2 6 5 R/LtfE 
ffi|$nTI/>So BE, §Pg|52 6 0 R, L(DJEJgtf#ft 
Lfc^S*»**a"rS»«S»B-b> , 9-2 9 1 Rtf2 9 2 
fcfc&K^ f*#gpaxy h 2 4 ortfcfi, $ 

[0 16 9] 8«8»K-by9-2 9 lSt>*2 9 21$, 01]* 

&£T«£nS 0 BSHiVVZ 9 3«, W* 20 

co i 7 0] mtmm-tyv29 \ rxsz 9 2 ©mat 

[0171] ±UWffi2 8 1 «U ^•b>'^2 9 1-29 
3 ©a*'tJ6» LT«*B«*^-t 5 >y * fcffliE-f 5 
CfctfT'tSo *0JI(«9fctt, 3 5, 2 4 30 

5, 2 5 5 R/L. 2 6 5 R/L©5*fc»LTjIl£;« 
ftM»*fTV\ AF^a^>yhgS2 0 0«±K, # 

[0 17 2] -AtBMnXy hHB2 0 0©$H*±T*©i: 
SSPaift. ZMP (ZeroMoment Point) «l 

at. tt#jf»k x&ask Harare* £*Kjrrsfci: 

V>K**BI»J»»2 3 5. 2 4 5, 2 5 5R/L, 2 
6 5 R/UC»f ^LT, &4©IiJ3iijaSg|S2 3 40 
5, 2 4 5, • • • HT'ti, ±fflfpa>2 8 1 *^©S« 
avyKfcHMRLT, §7^fai-^A 2 . As • • 

MPj tli, Wfi¥<Oft.W33K&Z> : £-*y r-tf-tfPi: 
%Z>fcffi±<D£,<DCtThr), Sfc, rzMP$ft3tJ i: 
fi, «>RtfAlfflSaoj|?yh«B2 0 0©*fT»flFlWlffl* 

czmp ttm < «u*%**-r So 
[0173] xfimai. nt}t#mmK.ftvT±c 
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®RttmtLxv>m&.t>, mixtit-tyht^yx 
-iHt-^yh^-ifpJ:*^ T&fr-£ tzmp 

(Zero Moment Point) J & 0 

[0 17 4] NAIMhcitfy KDjSaSSWfflJMMri* 

jcK01nIflltt^JS*S. ZMPOII^MICZM 

p **fra #>y h ofisswgijjaiBfcaffl-r sj&ko^ 

Tti\ Miorair Vukobratovicg "LEGGED LOCOMOTION ROB 
OTS" W-IWf W^7hfcAIO£J (B 

[0 17 5] -IRfcti, 4JE*fT«fe0feka-v/WF 
<0 <fc o & 2 JS^tf © D#7h ©?j tf, M/fcffiBtf US < * 
flo, *MrWOZMPSiaiWW'»lf\ LfctfoT, c 
CD«fed«:Kffi^O^b{C#0^l$^»©KSf±, 2£ 

[0 17 6] LX±©£?K, ABSSnffy h^B2 0 0 
It. &*©ffl*J»fflS2 3 5. 2 4 5, • • • 
ffllgi52 8 l<p6©«i=i"T>F*«HRbT, ^7^7^a 

x-^a 2 . A 3 • • - icMLxmmumm^mt} 

SllD#7M12 0 0tt, gW^t^fttli 

[0 17 7] ABBSo^-y h^S2 0 OtcfcltS 
$i)pnx-y h 2 8 OXlt, ±jaSLfcJ:5*§6»W»OB 

9fr$o>%p®nm*mmLx!iimLx^z 0 urn*.- 

>yh2 8 0T*«, B^b^l^^PilJS-try^-. '^+-fa 

[0 17 8] ±SiJP®2 8 1 fi, ±3S©*Hry9-^6« 

»gP2 8 lit, /Vyf»J)!)^«^n5/Wf')Si 
AMfi<Dm£ffig{£te#J-fS 0 DR AMfC^^nfcS 
i$i)®g|52 8 1 *<c©AIB§Ba#y h^B 
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2 0 OC)WlW®W*fto&lC*MZtlZo 
[0 1 7 9] ±Si]®gi5 2 8 1 Its Ad^Ptf-v bg«2 

oo <Dmmtf®xznrcmmm, ym?v>f7h*m.fr 

HlL> cn%DR AMlcftmtZo &tc, ±*J®gP2 8 
1 «\ ±$©<fc ? icifljfflgfl 2 8 1 iDDR AM KM* 

5o Efc, ±«W»2 8 C©!pJWrtt«»tfDRA 

#-yh^B2 0 0fc, v>frf$>§ T#S0J , r^Jg«3J 

[0 18 0] LfctfoT, AlBfiPtfv h^H2 0 0 

KU &©£*&tf«#frtt»c£i;TS»«fc 
R»Tf?5o *fc, ABiao*vHSB2 0 0tt, CC 

2ooow»wt©«i**i«iai±u K^WB^rai 

[0181] 

&HflWB*BW!Jtf*fc LTKBSftfcWBKfliB 
S§r-y;l/i:LTI21i^nfc*mgBIH1t^Si:, « 30 

BH©e*wrr shots nw^gMcfc^T 
sai^gfc fe^rtttH s nfcffi£> - 6 .at 
^rf*#u tt*snfcwrao 

*©«*»«£ >&]t«u «t.fiif^iBB*aiabfc 

[0 18 2] *5SWfc«5DsI?y h«B(i, ttfl^filc 

fci>T$®£nrcH®fr£»m£nfcffi^#— ^©h 



1$M2 003-4 4 0 80 
34 

[0 18 3] LfctfoT, *5SWC«Sa#«y hSBlC 

«fctitf» m^sm (WAtf, wm<D/s/m) % 
-stmrntowwuL BH©itg©»«cj:3AasB© 

[0 18 4] *fc,.*f8^Kff*P#-y h«Btt» #§§ 

© Lfrfc fc 4: lcWM&*1*l&ttX*PWmB&m 

[0 18 5] LfctfoT, *|8WK«SDJl?y hggfc 
J;fttf, AaLftS*fttffl***BJ»fcid5BlfflM, *t 

a— «f «tm£1tffi« A & ft HBSfc JStt««*< A* 
[0 18 6] *9m(Cff«j^mttB», 

^©#>«f left LT^B© Lfrtc t&glctim? Z^P 

[0187] Ltctfix, *ftwfc&zttmm&mic 
«fen«, ttfc sirens aw^tf, mm<D/s/m) z 

^€y«S©Kffi», aB©«l§©IB»»cJ:«A*^*© 

^{t, e^Kra«a©*«ttsfc«k*ji»»* , aiji:«n, 
«HS*R*»a-rsist©B«ffliit*^±-p#s. ctiK 

«kt), IESta^©b^fc*^^iaffl^(cgBlSTt 

^»je^ipj±-T5o 

[0 1 8 8] Sfc, *^(C«S^BKSB», 
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[0 18 9] LfctfoT, *IBBte«S:fc*KBS«fc: 
[0 1 9 0] tit. *frat«**MB»Wfel4, 10 

mmnfcgftzgpzmffi-tzgpmmxmt, mm 

tiicmm <o <D%mtt<D%* \cn lt#i#© l^tc t 

t. ^^n^xmias^r^m^nrc^p^mt 20 
gprnrnxmas^xmrnz nrc^pvmpteB t ttit 

T?feSkLT*ffii:K*IB©«eoU^fci:©J*lSii« 
%E«UfcB^B«ffl»»K:«ra»c82tt"r5E«{|i"J»I 

[0 l 9 l] Lfc^oT,.*«W(c«S*?K«Wj£K: 

«tn«, wc, mm<D/s/m z 

[0 l 9 2] Sfc, *«Wt«SX^KMWffiKJ:n 

:?©««© Lfrfc i: 6 JMIJRtefcJtfSS -eTSJSB 
[0 19 3] LfctfoT, *¥£miC&Z%&m.W5foc 40 

mmm%mm?z±x-&g£rj;?>%.mm&mn*3.- 
vw\ti+ zmm* < ft 9 ? ij^tttf fi±T a 1 1 & 

fc, a— WJRtt««*»l6ftV^tt, JRttttW&A 
[0 19 4] g{C, *%^{C^5»yD^A«, W. 
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BXKiii: * o Tata $ tircttfr hmzz nz>mm 
fimnrc^mm^y—yMcm^xn^b, 

^mm^mmic^xtkjti-gtitc&gptiimt 

[0 19 5] LfctfoT, #§8Wfc«S*J»:/ny5.k 

§§3j©/s/H) *308S©BIB«, JHH©I*E©« 
»fcJ:SA;&Sj|5©£{L *^KIBttmo*««h:J:S 

H*n*r § ^©Bsamig # i"i±-r § . 

[0 19 6] Sfc, ±JS©IW»^n^A%E««{*K 
ie»tTS^-r§ci:(i:J;oT, c ©BS«{**HB&nr 
&T*BlgsBlB¥® MB«*® tSi^a MBttS 

[0ffi©®¥ftUiB^] 

[03] *ftm<D-ffil$.mtLX7jkirutfy b&Siclo 

v%m^pmmm<Dmi&**?mtimx*ib%<> 

[05] *S6WO-fllj«fif<Ji:LT^-rp#'y 

[06] *5§WtD-«fig0Oi:LT^-ro#>y hgB<D^ 
*Sl*f— ^;l/%l«B^-ri>0Tfe§ o 
[07] *«W©-«J«««Ji:LT^-rP#»y hSHtfft 

[08] fcUTTRfPJpy HSS©£t 

^*igfflBiiifflS¥»*iiiB^-r50T'fe5o 
[010] *sm<D-mimt Lt^tu^7 bgs© 

So 
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[0 1 3] *RW©-«tf«lfcLTjjVroji?»y hgfitf) 
AWT/n^i*©^ fti&^x^-i'T? 'J ©*?§/&§: 

m l 4) *«WO-*ia«4: LXmtuXv hSBco 10 
[01 5] *ISfl©H»jaWfcLT^*o#y hggo 

tiai6] *mw<Q~mj&m£L-cmtAmmvtfv h 
[017] *%m<D-®j&mtL-cmA®mv#v b 

[018] *58W©-«l<««li: LX7K*Amtv$v h 

m<DmmMf£*T}i>*mjmic7rstm*&%o 20 
[019] *mi<D-mm\h LXKtAmmvxv v 

[02 0] 02 0 (a) It, r^jRj ^S*#{fii:-r5* 



HfOTt'fc'K 02 0 (b) tt % O&J £K*JIMfitfc 

^■r«^0T'fe5 o 

1 n#yhgf % 2 I«*a5a;=.'yK 3 A, ~3~B, 
3C, 3D pg|5a--yK 4 SWfcx.=.'yK 5 R 
^g]5a->yh, 10 CPU, 11 DRAM, 12 
77'r>aR0M« 13 P C Fl , V?-7x-)'X 

mss, 1 4 ffiwunss, 1 5 1 6 .3 

yha-;i/gp, 1 7 ^.yx'J, 1 8&j§je-fe>"9\ 1 
9 bm&-tyv. 20 ccda^i 21 *-yf- 

-fe>-9\ 2 2 Ha«-fe:/"9\ 2 3 2 4 

— 25, ~25n 7^faX-^i, 26, -26 
„ #f»3^ 27i~27. /n7, 28^ 
*U*-F» 100 ■tt€fBRgR& 110 «SSS 

agp, 1 1 1 nmmwi-r-^^ 1 2 0 mpmsm. 
1 2 1 gpmmm&w. 1 2 2 titf^K 1 2 3 
sig^k 1 2 4 ^g##T^ 1 3 0 mt>±i£ 
9, ho mmm^mm. 1 4 1 
yf-^^-x, 1 5 0 ^fKRMft 151 ¥ 

RRJWPjttr-^k 1 5 2 £3=B!*f— 7;k 1 5 
3 ttfctt^-f-TVk 2 00 AUBSnffy h8B 
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CLAIMS 
[Claim(s)] 

[Claim 1] A storage means for speech recognition by which correspondence _ 
relation with the method of the pronunciation of a word and this word was 
memorized as a dictionary for speech recognition in the robot equipment which 
operates autonomously according to an internal state, A word phonetic storage 
means by which correspondence relation with the phonogram of a word and this 
word was memorized as a word phonetic table, An image pick-up means to 



picturize a photographic subject, and an image recognition means to extract the 
image of a predetermined pattern from the image picturized in the 
above-mentioned image pick-up means, A sound-collecting means to acquire a 
surrounding sound, and a speech recognition means to recognize voice from the 
sound acquired in the above-mentioned sound-collecting means, Two or more 
kinds of phonograms presumed from the above-mentioned predetermined 
pattern extracted in the above-mentioned image recognition means are given 
based on the above-mentioned word phonetic table. A pronunciation information 
generation means to generate the method of pronunciation, and the voice wave 
equivalent to pronunciation to each of two or more kinds of phonograms by 
which grant was carried out [ above-mentioned ], The voice wave of the voice 
recognized in each voice wave and the above-mentioned speech recognition 
means which were generated in the above-mentioned pronunciation information 
generation means is compared. Robot equipment characterized by having a 
storage control means to memorize newly in the above-mentioned dictionary for 
speech recognition noting that it is the method of the pronunciation 
corresponding to the pattern recognition result from which the nearest voice 
wave was extracted in the above-mentioned image recognition means. 
[Claim 2] The image of the above-mentioned predetermined pattern is robot 
equipment according to claim 1 characterized by being the character string 



which consists of an alphabetic character and/or two or more alphabetic 
characters. 

[Claim 3] Robot equipment according to claim 2 characterized by having a 
temporary storage means given to two or more alphabetic characters and these_ 
alphabetic characters which are extracted from the above-mentioned image to 
memorize temporarily correspondence with the method of two or more kinds of 
pronunciation as a dictionary. 

[Claim 4] It is robot equipment according to claim 2 which is equipped with a 
word information storage means by which word information including the 
phonogram and word attribute of a word and this word was memorized as a 
word attribute table, and is characterized by making the above-mentioned word 
attribute correspond and memorizing it in the above-mentioned dictionary for 
speech recognition with the method of the pronunciation of an alphabetic 
character and this alphabetic character which memorizes the above-mentioned 
storage control means newly. 

[Claim 5] It is robot equipment according to claim 4 which is equipped with the 
dialogue management tool which generates the response to the voice 
recognized in the above-mentioned speech recognition means, and is 
characterized by using the above-mentioned word attribute for the _ 
above-mentioned dialogue management tool under the response regulation over 



voice. 

[Claim 6] The above-mentioned speech recognition means is robot equipment 
according to claim 2 characterized by recognizing voice based on a hidden 
Markov model method. 

[Claim 7] A storage means for speech recognition by which correspondence 
relation with the method of the pronunciation of a word and this word was 
memorized as a dictionary for speech recognition, A word phonetic storage 
means by which correspondence relation with the phonogram of a word and this 
word was memorized as a word phonetic table, An image pick-up means to 
picturize a photographic subject, and an image recognition means to extract the 
image of a predetermined pattern from the image picturized in the 
above-mentioned image pick-up means, A sound-collecting means to acquire a 
surrounding sound, and a speech recognition means to recognize voice from the 
sound acquired in the above-mentioned sound-collecting means, Two or more - 
kinds of phonograms presumed from the image of the above-mentioned 
predetermined pattern extracted in the above-mentioned image recognition 
means are given based on the above-mentioned word phonetic table. A 
pronunciation information generation means to generate the method of 
pronunciation, and the voice wave equivalent to pronunciation to each of two or 
more kinds of phonograms by which grant was carried out [ above-mentioned ], 



The voice wave of the voice recognized in each voice wave and the 
above-mentioned speech recognition means which were generated in the 
above-mentioned pronunciation information generation means is compared. The 
character reader characterized by having a storage control means to memorize 
newly in the above-mentioned dictionary for speech recognition noting that it is 
the method of the pronunciation of the alphabetic character which carried out 
[ above-mentioned ] the extract of the nearest voice wave. 
[Claim 8] The image of the above-mentioned predetermined pattern is a 
character reader according to claim 7 characterized by being the character string 
which consists of an alphabetic character and/or two or more alphabetic 
characters. 

[Claim 9] The character reader according to claim 7 characterized by having a 
temporary storage means given to two or more alphabetic characters and these 
alphabetic characters which are extracted from the above-mentioned image to 
memorize temporarily correspondence with the method of two or more kinds of 
pronunciation as a dictionary. 

[Claim 10] It is the character reader according to claim 7 which is equipped with 
a word information storage means by which word information including the 
phonogram and word attribute of a word and this word was memorized as a 
word attribute table, and is characterized by making the above-mentioned word 



attribute correspond and memorizing it in the above-mentioned dictionary for 
speech recognition with the method of the pronunciation of an alphabetic 
character and this alphabetic character which memorizes the above-mentioned — 
storage control means newly. 

[Claim 1 1] It is the character reader according to claim 10 which is equipped with 
the dialogue management tool which generates the response to the voice 
recognized in the above-mentioned speech recognition means, and is 
characterized by using the above-mentioned word attribute for the 
above-mentioned dialogue management tool under the response regulation over 
voice. 

[Claim 12] The above-mentioned speech recognition means is a character 
reader according to claim 7 characterized by recognizing voice based on a 
hidden Markov model method. 

[Claim 13] The image pick-up process which picturizes a photographic subject, 
and the image recognition process which extracts the image of a predetermined 
pattern from the image picturized in the above-mentioned image pick-up process, 
The sound-collecting process which acquires a surrounding sound, and the 
speech recognition process which recognizes voice from the sound acquired in 
the above-mentioned sound-collecting process, Two or more kinds of 
phonograms presumed from the image of the predetermined pattern extracted in 



the above-mentioned image recognition process are given based on the word 
phonetic table on which correspondence relation with the phonogram of a word 
and this word was memorized. The pronunciation information generation 
process which generates the method of pronunciation, and the voice wave 
equivalent to pronunciation to each of two or more kinds of phonograms by — 
which grant was carried out [ above-mentioned ], The voice wave of the voice 
recognized in each voice wave and the above-mentioned speech recognition 
process which were generated in the above-mentioned pronunciation 
information generation process is compared. The character recognition 
approach characterized by equipping the dictionary for speech recognition which 
memorized correspondence relation with the method of the pronunciation of a 
word and this word for being the method of the pronunciation of the alphabetic 
character which carried out [ above-mentioned ] the extract of the nearest voice 
wave with the storage control process memorized newly. 
[Claim 14] The image of the above-mentioned predetermined pattern is the 
character recognition approach according to claim 13 characterized by being the 
character string which consists of an alphabetic character and/or two or more 
alphabetic characters. 

[Claim 15] The character recognition approach according to claim 14 
characterized by having the process which is given to two or more alphabetic - 



characters and these alphabetic characters which are extracted from the 
above-mentioned image, and which is memorized for a temporary storage 
means temporarily by using correspondence with the method of two or more 
kinds of pronunciation as a dictionary. 

[Claim 16] The character recognition approach according to claim 14 
characterized by what a word attribute is made to correspond and is memorized 
in the above-mentioned dictionary for speech recognition with the method of the— 
pronunciation of the alphabetic character memorized newly and this alphabetic 
character at the above-mentioned storage control process. 
[Claim 17] The character recognition approach according to claim 16 which is 
equipped with the dialogue management process which generates the response 
to the voice recognized in the above-mentioned speech recognition process, and 
is characterized by using the above-mentioned word attribute under the 
response regulation over voice at the above-mentioned dialogue management 
process. 

[Claim 18] The character recognition approach according to claim 14 
characterized by carrying out speech recognition based on a hidden Markov 
model method at the above-mentioned speech recognition process. 
[Claim 19] In the control program of the robot equipment which operates 
autonomously according to an internal state The image pick-up processing 



which picturizes a photographic subject, and the image recognition processing 
which extracts the image of a predetermined pattern from the image picturized 
by the above-mentioned image pick-up processing, The sound-collecting 
processing which acquires a surrounding sound, and the speech recognition 
processing which recognizes voice from the sound acquired by the 
above-mentioned sound-collecting processing, Two or more kinds of 
phonograms presumed from the image of the predetermined pattern extracted 
by the above-mentioned image recognition processing are given based on the 
word phonetic table on which correspondence relation with the phonogram of a~~ 
word and this word was memorized. The pronunciation information generation 
processing which generates the method of pronunciation, and the voice wave 
equivalent to pronunciation to each of two or more kinds of phonograms by 
which grant was carried out [ above-mentioned ], The voice wave of the voice 
recognized in each voice wave and the above-mentioned speech recognition 
processing which were generated by the above-mentioned pronunciation 
information generation processing is compared. The control program 
characterized by making robot equipment perform storage processing newly 
memorized in the dictionary for speech recognition which memorized 
correspondence relation with the method of the pronunciation of a word and this 
word for being the method of the pronunciation of the alphabetic character which 



carried out [ above-mentioned ] the extract of the nearest voice wave. 
[Claim 20] The image of the above-mentioned predetermined pattern is a control 
program according to claim 19 characterized by being the character string which 
consists of an alphabetic character and/or two or more alphabetic characters. 
[Claim 21] The image pick-up processing which picturizes a photographic 
. subject, and the image recognition processing which extracts the image of a 
predetermined pattern from the image picturized by the above-mentioned image 
pick-up processing, The sound-collecting processing which acquires a 
surrounding sound, and the speech recognition processing which recognizes 
voice from the sound acquired by the above-mentioned sound-collecting 
processing, Two or more kinds of phonograms presumed from the image of the 
predetermined pattern extracted by the above-mentioned image recognition 
processing are given based on the word phonetic table on which 
correspondence relation with the phonogram of a word and this word was 
memorized. The pronunciation information generation processing which 
generates the method of pronunciation, and the voice wave equivalent to 
pronunciation to each of two or more kinds of phonograms by which grant was 
carried out [ above-mentioned ], The voice wave of the voice recognized in each 
voice wave and the above-mentioned speech recognition processing which were 
generated by the above-mentioned pronunciation information generation 



processing is compared. The record medium with which the control program for 
making robot equipment perform storage processing newly memorized in the 
dictionary for speech recognition which memorized correspondence relation with 
the method of the pronunciation of a word and this word for being the method of 
the pronunciation of the alphabetic character which carried out 
[ above-mentioned ] the extract of the nearest voice wave was recorded. 
[Claim 22] The image of the above-mentioned predetermined pattern is a record 
medium according to claim 21 characterized by being the character string which 
consists of an alphabetic character and/or two or more alphabetic characters. 



DETAILED DESCRIPTION 

[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] The robot equipment, the character reader, and the 
character recognition approach this invention operates autonomously according 
to an internal state, The image of a predetermined pattern is recognized from the 
image especially picturized about the control program and the record medium in 
the list. The robot equipment which matches the acquired voice with this 



recognition image, and registers it newly with this image, The character reader 
and the character recognition approach of matching the voice acquired with the 
image of the picturized predetermined pattern with this recognition image, and 
registering it into a list newly, It is related with the record medium with which the 
control program which performs processing which recognizes the image of a 
predetermined pattern from the acquired image, matches with this recognition 
image the voice acquired with this image, and registers with a list newly, and this 
control program were recorded. 
[0002] 

[Description of the Prior Art] The machinery which performs movement which 
resembled actuation of human being (living thing) using the electric or magnetic 
operation is called "robot." Although it was from the end of the 1960s that a robot 
began to spread in our country, the many were the industrial robots (Industrial 
Robot) in works aiming at automation, full automation, etc. of production, such 
as a manipulator and a carrier robot. _ 
[0003] Recently, a life is supported as human being's partner, namely, 
development of the practical use robot which supports the human activity in 
various scenes on the everyday life of living conditions and others is furthered. 
Unlike the industrial robot, such a practical use robot has human being to whom 
individuality was separately different, or the capacity to learn the adaptation 



approach to various environments oneself, in various aspects of affairs of human 
being's living environment. For example, leg formula mobile robots, such as a 
"pet mold" robot which imitated the body mechanism of the animal of 
quadrapedalism and its actuation like a dog and a cat, "a human mold" designed 
by using as a model the body mechanism of an animal and actuation which 
perform a 2-pair-of-shoes walk in erect posture, or a "human form" robot 
(Humanoid Robot), are already being put in practical use. Since these leg 
formula mobile robots have the appearance configuration possible nearest to the 
appearance of an animal or human being, actuation near actuation of an animal 
and human being can be performed as compared with an industrial robot and" 
various actuation which thought entertainment nature as important further can 
be performed, it may be called an entertainment robot. 

[0004] A thing equipped with the miniature camera equivalent to a "eye", the 
sound-collecting microphone equivalent to a "lug", etc. is also in a leg formula 
mobile robot. In this case, by performing an image processing to the acquired 
image, a leg formula mobile robot can recognize the surrounding environment 
where it inputted as image information, or can recognize "language" from the 
sound of the inputted perimeter. 

[0005] By recognizing the voice especially acquired from the exterior, it changes 
into an alphabetic character or the technique which recognizes voice, has 



answered enough and is carried out is applied to a personal computer and other 
electronic equipment as a voice recognition unit besides the leg formula mobile 
robot. 

[0006] By the technique of the conventional speech recognition, the 
pronunciation and the notation of a word are carrying out speech recognition 
using the dictionary for speech recognition (it is hereafter described as the 
dictionary for recognition.) matched and memorized. Therefore, there was a fault 
that it could not recognize about the word which is not registered into the 
dictionary for recognition. Furthermore, when recognizing the pronunciation of a 
continuous word like a "sentence", you must be the combination of the word 
registered into the dictionary for recognition. That is, when the word which is not 
registered into the dictionary for authentication is contained, it cannot recognize^ 
whether it is incorrect-recognized. 

[0007] If the word "North Shinagawa" is taken for an example and "North 
Shinagawa" is not registered into the dictionary for authentication, pronunciation 
including "North Shinagawa" and "North Shinagawa", for example, the voice 
which consists of continuation of the word "where North Shinagawa is", cannot 
be recognized, or the part of "North Shinagawa" is incorrect-recognized. So, in 
order to enable it to recognize the word which is not registered into the dictionary 
for recognition, it is necessary to newly carry out additional registration of the 



non-registered word. 

[0008] The "PLU train" to which the dictionary for recognition which it has in 
order that a voice recognition unit may make speech recognition possible 
expresses the pronunciation information on the word as the "word symbol" as an 
identifier for distinguishing from other words is matched. PLU(s) (Phonone-like 
unit) are acoustical and a thing used as a phonological unit. The pronounced 
voice can surely be expressed as a combination (PLU train) of PLU. 
[0009] Therefore, what is necessary is just to add a word symbol and the PLU 
train corresponding to this, when registering a word into the dictionary for 
recognition. However, the case where a word symbol and a PLU train can be 
added is restricted when the direct input of the notation "North Shinagawa" and 
"kitashinagawa" can be carried out using input means, such as a keyboard. 
[0010] Therefore, when it does not have an input means like a keyboard like 
robot equipment, there is also a method of carrying out speech recognition of the_ 
pronunciation of the word acquired as voice, and acquiring the PLU train of a 
strange word. In this case, it recognizes with the application of a garbage model 
(garbage model). A garbage model is a model (in however, the case of 
Japanese) which expressed voice as a combination of a "phoneme" used as the 
fundamental unit of pronunciation, and expressed it as a combination of the 
"kana" used as the fundamental unit of the reading of a word, as shown in 



drawing 20 (a) and drawing 20 (b). 

[0011] In the conventional voice recognition unit, by applying a garbage model, 
the recognition result with voice obtained, applied the word symbol to this 
recognition result, these were made to correspond, and it has registered with the 
dictionary for recognition as a new word. 

[0012] However, a "phoneme" and "PLU" are mostly used as a word of 
homonymy, and a "PLU train" writes the pronunciation of the word which 
consisted of that two or more "PLU(s)" was connected here. 
[0013] 

[Problem(s) to be Solved by the Invention] however, by the technique of the 
conventional speech recognition which applied the garbage model A delicate 
difference being in the method of utterance to every user, even if it is the same 
word, and weak phonemes (for example, /s/of the initial of the word etc.) There 
was a fault that becoming that it is hard to be recognized inevitably, change of 
the phoneme under the effect of a surrounding noise, failure in voice section 
detection, etc. became a cause, and recognition precision worsened. 
[0014] Since it is used in many cases under the situation that the distance of the 
microphone for the voice acquisition by the side of a voice recognition unit and a 
user (audio source) is separated when a voice recognition unit is especially 
applied to robot equipment, the frequency of incorrect recognition becomes high. 



[0015] concrete - for example, — n - if the case where cause and ******" is made 
to recognize is shown — a recognition result — "hi to tsu na no ga" and "i tas na 
ga:" — like — "— although it causes and is similar with ******", it may be_ 
recognized as a PLU train which is not the same If speech recognition is 
performed using the dictionary by which word registration was carried out by 
such approach, problems, such as a display error by the fall of recognition 
precision and incorrect recognition, will occur. That is, since the inaccurate PLU 
train would be given, there was a trouble that the precision at the time of 
recognizing this word fell in a new registration word. 

[0016] the case where a person other than the person who registered 
pronounces the same word — temporary — "-- ****** it caused and ******" was 
registered into the dictionary for recognition — from the peculiarity of the 
pronunciation for every user — "— the pronunciation which causes and contains 
the word ******" may not have been recognized 

[0017] Moreover, when changing and displaying the result of speech recognition 
on an alphabetic character, since the information about a display is not given, 
the mistaken alphabetic character may be displayed on a new registration word, 
a "user — "-- the case where it utters to a voice recognition unit, saying "I want ta_ 
go to North Shinagawa" after causing and registering ******" with voice — a voice 
recognition unit - even if it causes and ******" is recognized correctly, a display 



wants to go" to "hitotsunanoga - "-- " - it is one — " — I want to go - " - it may 
become Moreover, also when a voice recognition unit repeats the PLU train of a 
recognition result by speech synthesis, it also produces un-arranging [ of being 
uttered as relation only with the unnatural part of the PLU train of the 
compounded new registration word ]. — 
[0018] Furthermore, the new registration word registered with the garbage model 
in this way cannot register information about the attribute of words, such as a 
part of speech and semantics. For example, even if it registers "North 
Shinagawa", information showing whether this word is a noun or it is the name of 
a place cannot be registered, therefore - temporary - for example, - a dialogue 
— ** — syntax — recognition — ** — language — a model — etc. — "-- < — the name 
of a place - expressing - a word — >-- + -- + - where — + — it is — + -- it is — " — 
like - specification - an expression - a sake - the syntax rule - beforehand - 
recording - having - **** ~ ****** — new — registration - a word — **** — being 
inapplicable - ** - saying - a trouble - it was . Although it could input with voice 
also about the attribute of a word at the time of registration, the user needed to 
know the attribute of a word. Moreover, it is troublesome for a user to input an 
attribute in addition to the register operation of a word. 

[0019] Then, by recognizing the voice which recognizes and acquired the 
alphabetic character from the picturized image as pronunciation of this- 



alphabetic character to the voice which this invention is proposed in view of such 
the conventional actual condition, and is pronounced with the shown alphabetic 
character As opposed to the voice pronounced with the alphabetic character 
which the robot equipment which can recognize the new word which could 
register with the dictionary for recognition by having made the non-registered 
word into the new word, and was registered further with a sufficient precision, 
and a list were shown By recognizing the voice which recognizes and acquired^ 
the alphabetic character from the picturized image as pronunciation of this 
alphabetic character The character reader which can recognize the new word 
which could register with the dictionary for recognition by having made the 
non-registered word into the new word, and was registered with a sufficient 
precision, And by picturizing the shown alphabetic character, recognizing an 
alphabetic character from the picturized image, and recognizing the voice 
pronounced with presentation as pronunciation of the alphabetic character 
acquired and recognized It aims at offering the record medium with which the 
control program which performs processing which registers newly the voice 
which recognizes and acquired the alphabetic character from the picturized 
image in the character recognition approach registered into the dictionary for 
recognition as a new word and the list as pronunciation of this alphabetic 
character, and this control program were recorded. 



[0020] 

[Means for Solving the Problem] In order to attain the purpose mentioned abover 
the robot equipment concerning this invention A storage means for speech 
recognition by which correspondence relation with the method of the 
pronunciation of a word and this word was memorized as a dictionary for speech 
recognition, A word phonetic storage means by which correspondence relation 
with the phonogram of a word and this word was memorized as a word phonetic 
table, An image pick-up means to picturize a photographic subject, and an 
image recognition means to extract the image of a predetermined pattern from 
the image picturized in the image pick-up means, A sound-collecting means to 
acquire a surrounding sound, and a speech recognition means to recognize 
voice from the sound acquired in the sound-collecting means, Two or more kinds 
of phonograms presumed from the image of the predetermined pattern extracted 
in the image recognition means are given based on a word phonetic table. A 
pronunciation information generation means to generate the method of 
pronunciation, and the voice wave equivalent to pronunciation to each of two or 
more kinds of given phonograms, The voice wave of the voice recognized irr 
each voice wave and speech recognition means which were generated in the 
pronunciation information generation means is compared, and the dictionary for 
speech recognition is equipped with a storage control means to memorize newly 



noting that it is the method of the pronunciation of an alphabetic character which 
extracted the nearest voice wave. 

[0021] Such robot equipment gives two or more kinds of phonograms presumed 
from the image of the predetermined pattern extracted in the image recognition- 
means based on a word phonetic table. The method of pronunciation and the 
voice wave equivalent to pronunciation are generated to each of two or more 
kinds of given phonograms. The voice wave of the voice recognized in each 
voice wave and speech recognition means which were generated in the 
pronunciation information generation means is compared, and it memorizes 
newly in the dictionary for speech recognition noting that it is the method of the 
pronunciation corresponding to the image of the predetermined pattern which 
extracted the nearest voice wave. 

[0022] Especially the image of a predetermined pattern is a character string 
which consists of an alphabetic character and/or two or more alphabetic 
characters here. 

[0023] Moreover, a storage means for speech recognition by which, as for the 
character reader concerning this invention, correspondence relation with the 
method of the pronunciation of a word and this word was memorized as a 
dictionary for speech recognition, A word phonetic storage means by which- 
correspondence relation with the phonogram of a word and this word was 



memorized as a word phonetic table, An image pick-up means to picturize a 
photographic subject, and an image recognition means to extract the image of a 
sentence predetermined pattern from the image picturized in the image pick-up 
means, A sound-collecting means to acquire a surrounding sound, and a speech 
recognition means to recognize voice from the sound acquired in the 
sound-collecting means, Two or more kinds of phonograms presumed from the 
image of the predetermined pattern extracted in the image recognition means 
are given based on a word phonetic table. A pronunciation information 
generation means to generate the method of pronunciation, and the voice wave 
equivalent to pronunciation to each of two or more kinds of given phonograms, 
The voice wave of the voice recognized in each voice wave and speech 
recognition means which were generated in the pronunciation information 
generation means is compared, and the dictionary for speech recognition 
equipped with a storage control means to memorize newly noting that it is the 
method of the pronunciation of an alphabetic character which extracted the 
nearest voice wave. 

[0024] Such a character reader gives two or more kinds of phonograms 
presumed from the image of the predetermined pattern extracted in the image 
recognition means based on a word phonetic table. The method of pronunciation 
and the voice wave equivalent to pronunciation are generated to each of two or 



more kinds of given phonograms. The voice wave of the voice recognized in 
each voice wave and speech recognition means which were generated in the 
pronunciation information generation means is compared, and it memorizes 
newly in the dictionary for speech recognition noting that it is the method of the 
pronunciation of an alphabetic character which extracted the nearest voice wave. 
[0025] Especially the image of a predetermined pattern is a character string 
which consists of an alphabetic character and/or two or more alphabetic 
characters here. ~ 
[0026] Moreover, the image pick-up process at which the character recognition 
approach concerning this invention picturizes a photographic subject, The image 
recognition process which extracts the image of a predetermined pattern from 
the image picturized in the image pick-up process, The sound-collecting process 
which acquires a surrounding sound, and the speech recognition process which 
recognizes voice from the sound acquired in the sound-collecting process, Two 
or more kinds of phonograms presumed from the alphabetic character extracted 
in the image recognition process are given based on the word phonetic table on 
which correspondence relation with the phonogram of a word and this word was 
memorized. The pronunciation information generation process which generates 
the method of pronunciation, and the voice wave equivalent to pronunciation to 
each of two or more kinds of given phonograms, The voice wave of the voice 



recognized in each voice wave and speech recognition process which were 
generated in the pronunciation information generation process is compared. The 
dictionary for speech recognition which memorized correspondence relation with 
the method of the pronunciation of a word and this word is equipped with the 
storage control process memorized newly noting that it is the method of the 
pronunciation of an alphabetic character which extracted the nearest voice wave. 
[0027] According to such a character recognition approach, two or more kinds of 
phonograms presumed from the image of the predetermined pattern extracted in 
the image recognition process are given based on a word phonetic table. The 
method of pronunciation and the voice wave equivalent to pronunciation are" 
generated to each of two or more kinds of given phonograms. The voice wave of 
the voice recognized in each voice wave and speech recognition process which 
were generated in the pronunciation information generation process is 
compared, and the dictionary for speech recognition memorizes newly noting 
that it is the method of the pronunciation of the alphabetic character which the 
nearest voice wave extracted. 

[0028] Especially the image of a predetermined pattern is a character string 
which consists of an alphabetic character and/or two or more alphabetic 
characters here. 

[0029] Furthermore, the image pick-up processing whose control program 



concerning this invention picturizes a photographic subject, The image 
recognition processing which extracts the image of a predetermined pattern from 
the image picturized by image pick-up processing, The sound-collecting 
processing which acquires a surrounding sound, and the speech recognition 
processing which recognizes voice from the sound acquired by sound-collecting 
processing, Two or more kinds of phonograms presumed from the alphabetic 
character extracted by image recognition processing are given based on the 
word phonetic table on which correspondence relation with the phonogram of a 
word and this word was memorized. The pronunciation information generation 
processing which generates the method of pronunciation, and the voice wave 
equivalent to pronunciation to each of two or more kinds of given phonograms, 
The voice wave of the voice recognized in each voice wave and speech^ 
recognition processing which were generated by pronunciation information 
generation processing is compared. Robot equipment is made to perform 
storage processing newly memorized in the dictionary for speech recognition 
which memorized correspondence relation with the method of the pronunciation 
of a word and this word noting that it is the method of the pronunciation of an 
alphabetic character which extracted the nearest voice wave. 
[0030] Especially the image of a predetermined pattern is a character string 
which consists of an alphabetic character and/or two or more alphabetic 



characters here. Moreover, an above-mentioned control program is recorded on 

a record medium, and is offered. 

[0031] 

[Embodiment of the Invention] The robot equipment shown as an example of 1 
configuration of this invention is robot equipment which carries out autonomous 
working according to an internal state. This robot equipment is a leg formula 
mobile robot which has an upper extremity, the truncus section, and the 
membrum inferius at least, and makes a migration means only an upper 
extremity and the membrum inferius, or the membrum inferius. Although there is 
robot equipment imitating the pet mold robot imitating the body mechanism of 
the animal of quadrapedalism or its motion, the body mechanism of the animal of 
the 2-pair-of-shoes walk which uses only the membrum inferius as a migration 
means, or its motion in a leg formula mobile robot, the robot equipment shown 
as a gestalt of this operation is a quadrapedalism type leg formula mobile robot. 
[0032] This robot equipment can act according to internal states (getting angry 
sadness, joy, pleasure, etc.), and also is a practical use robot which supports the 
human activity in various scenes on the everyday life of living conditions and 
others, and is the entertainment robot which can express the fundamental 
actuation which the animal of quadrapedalism performs. 

[0033] Especially this robot equipment is the form which imitated the "dog", and 



has a head, idiosoma, the upper extremity section, the membrum-inferius- 
section, the tail section, etc. The part equivalent to the joining segment and joint 
of each part is equipped with the actuator and potentiometer of the number 
according to the degree of freedom of movement, and control of a control 
section can express target actuation. 

[0034] This robot equipment is equipped with the various sensors for detecting 
the operation received from the image pick-up section for acquiring a 
surrounding situation as image data, the microphone section which acquires 
surrounding voice, and the exterior etc. As the image pick-up section, a small 
CCD (Charge-Coupled Device) camera is used. 

[0035] The robot equipment shown as a gestalt of this operation is equipped with 
image recognition equipment and a voice recognition unit, and generates the 
voice wave which gives two or more kinds of reading kanas which extract the 
image of a predetermined pattern from the image picturized in the CCD camera, 
and are presumed from the image of the extracted predetermined pattern, and is 
equivalent to each of two or more kinds of given reading kanas. As a- 
predetermined pattern of an image here, the image of an alphabetic character 
(character string), an objective configuration, a profile, a shank, and the body 
itself etc. is raised. And it is robot equipment newly memorizable in the dictionary 
for speech recognition noting that it is the method (reading) of the pronunciation 



corresponding to the image of the predetermined pattern which compared this 
voice wave with the voice wave of the voice acquired in the microphone section, 
and extracted the nearest voice wave. 

[0036] Hereafter, the robot equipment shown as an example of 1 configuration of 
this invention is explained with reference to a drawing. The following explanation 
explains the case where the predetermined pattern recognized from the 
acquired image is an alphabetic character (character string) to a detail. 
[0037] With the gestalt of this operation, robot equipment 1 is the so-called pet 
mold robot of the configuration which imitated the "dog", as shown in drawing 1. 
The leg units 3A, 3B, and 3C and 3D are connected with front and rear, right and 
left of the idiosoma unit 2, the head unit 4 is connected with the front end section 
of the idiosoma unit 2, the tail section unit 5 is connected and robot equipment 1 
is constituted by the back end section. 

[0038] As shown in drawing 2 , the control section 16 formed by connecting CPU 
(Central Processing Unit)10, DRAM (Dynamic Random Access Memory)11, a 
flash ROM (Read Only Memory) 12, PC (Personal Computer) card interface 
circuitry 13, and a digital disposal circuit 14 mutually through an internal bus 15 
and the dc-battery 17 as a source of power of this robot equipment 1 are 
contained by the idiosoma unit 2. Moreover, the angular-velocity sensor 18 and 
acceleration sensor 19 for detecting the sense of robot equipment 1 and the 



acceleration of a motion are contained by the idiosoma unit 2. 
[0039] The CCD (Charge Coupled Device) camera 20 for picturizing an external 
situation to the head unit 4, The touch sensor 21 for detecting the pressure 
received by "it strokes" and the physical influence of "striking" from a user, The 
distance robot 22 for measuring the distance to the body located ahead, LED_ 
(Light Emitting Diode) (not shown) equivalent to the microphone 23 for collecting 
alien frequencies, the loudspeaker 24 for outputting voice, such as a cry, and the 
"eye" of robot equipment 1 etc. is arranged in the predetermined location, 
respectively. CCD camera 20 can picturize the photographic subject which the 
head unit 4 tends to turn to with a predetermined field angle. 
[0040] Actuators 251 -25n and Potentiometers 261 -26n for free frequency are 
arranged by the joint part of each leg unit 3A-3D, the joining segment of each leg 
unit 3A-3D and the idiosoma unit 2, the joining segment of the head unit 4 and 
the idiosoma unit 2, and the joining segment of the tail section unit 5 and tail 5A, 
respectively. Actuators 251 -25n have the servo motor as a configuration. Leg 
unit 3A - 3D are controlled by the drive of a servo motor, and it changes in a 
target posture or actuation. 

[0041] LED and each actuators 251 -25n are connected with the digital disposal 
circuit 14 of the control section 16 through the hubs 271 -27n corresponding to 
various sensor lists, such as these angular-velocity sensor 18, an acceleration^ 



sensor 19, a touch sensor 21, a distance robot 22, a microphone 23, a 
loudspeaker 24, and each potentiometers 261 -26n, respectively, and direct 
continuation of CCD camera 20 and the dc-battery 17 is carried out to the digital 
disposal circuit 14, respectively. 

[0042] A digital disposal circuit 14 incorporates sensor data, and the image data 
and voice data which are supplied from each above-mentioned sensor one by 
one, and carries out sequential storing of these through an internal bus 15 in the 
predetermined location in DRAM11, respectively. Moreover, a digital disposal 
circuit 14 incorporates the dc-battery residue data showing the dc-battery 
residue supplied from a dc-battery 17 with this one by one, and stores this in the 
predetermined location in DRAM11. 

[0043] Thus, each sensor data stored in DRAM11, image data, voice data, and 
dc-battery residue data are used in case CPU 10 performs motion control of the 
robot equipment 1 concerned. 

[0044] CPU 10 reads the control program stored in the flash ROM 12 at the time- 
of the first stage when the power source of robot equipment 1 was switched on, 
and stores it in DRAM11. Or CPU10 reads the semiconductor memory 
equipment with which the PC Card slot of the idiosoma unit 2 which is not 
illustrated to drawing 1 was equipped, for example, the control program stored in 
the so-called memory card 28, through the PC card interface circuitry 13, and 



stores it in DRAM11. 

[0045] CPU 10 judges [ as mentioned above ] the situation of self and a perimeter, 
and the existence of the directions from a user, and influence based on each 
sensor data by which sequential storing is carried out, image data, voice data, 
and dc-battery residue data in DRAM11 from the digital disposal circuit 14. 
[0046] Furthermore, CPU 10 opts for the action based on this decision result and 
the control program stored in DRAM11. By driving the actuator needed out of 
Actuators 251 -25n based on the decision result concerned, CPU 10 moves the 
head unit 4 vertically and horizontally, moves the tail of the tail section unit 5, or_ 
drives and walks him around each leg unit 3A thru/or 3D. Moreover, CPU 10 
generates voice data if needed, and supplies it to a loudspeaker 24 through a 
digital disposal circuit 14. Moreover, CPU10 generates the signal which directs 
lighting and putting out lights of above-mentioned LED, and LED is turned on or 
it switches it off. 

[0047] Moreover, CPU 10 operates a robot according to the demand from the 
dialogue Management Department 110 grade which a robot is controlled 
autonomously as mentioned above, and also is mentioned later. 
[0048] By these fundamental configurations, robot equipment 1 acts 
autonomously according to the situation of self and a perimeter, the directions 
from a user, and influence. 



[0049] Furthermore, robot equipment 1 equips the control section 16 of the 
idiosoma unit 2 with the image speech recognition section 100 as a configuration 
for registering with the dictionary for speech recognition by making 
correspondence with the alphabetic character recognized to be the recognized- 
pronunciation into a new registration word. The image speech recognition 
section 100 has the dialogue Management Department 110, the speech 
recognition section 120, the output generation section 130, the 
image-processing character recognition section 140, and the pronunciation 
information generation section 150, as shown in drawing 3 . As it is indicated in 
drawing 4 as the dictionary for speech recognition, it is the table which recorded 
the "PLU train" which expresses the pronunciation information corresponding to 
this word as the "word symbol" as an identifier for distinguishing from other 
words. By referring to this dictionary, the notation of the method (reading) of the 
pronunciation of a word or the word corresponding to pronunciation can be 
extracted. 

[0050] Concretely, the dialogue Management Department 110 generates the 
response to the voice inputted from utterance of the user who inputted from the 
microphone 23^ dialogue hysteresis, etc. The dialogue Management Department 
110 generates the response pattern to the inputted voice based on the various- 
dialogue regulations memorized by the dialogue regulation table 111. 



[0051] The speech recognition section 120 changes a user's utterance into the 
frame for the format which can be processed at the dialogue Management 
Department 110, for example, text format, syntax analysis, and a dialogue etc. 
Specifically, the speech recognition section 120 consists of the dictionary 121 for 
speech recognition, the sound model 122, a language model 123, and 
sonagraphy section 124 grade. In the sonagraphy section 124, the extract of 
characteristic quantity required for recognition is performed with a very small 
time interval. For example, the energy of the acquired sound signal, the number 
of zero crossovers, a pitch, frequency characteristics, such variation, etc. are 
extracted. Linear predictive coding (LPC), a fast Fourier transform (FFT), a band 
pass filter (BPF), etc. are used for a frequency analysis. 

[0052] The speech recognition section 120 determines the word sequence 
corresponding to the characteristic quantity sequence generated in the 
sonagraphy section 124 using the sound model 122 and the language model 
123. As the recognition technique, a hidden Markov model (it is described as 
HMM below Hidden Markov Model:.) etc. is used, for example. 
[0053] In HMM, it is a state-transition model with state transition probability and a 
probability density function, and changing a condition, the probability value 
which outputs a characteristic quantity sequence is accumulated, and likelihood 
is determined. It is the technique used for matching with the method of the 



pronunciation given to the alphabetic character recognized in the method of the 
pronunciation of the word memorized by using the value of the likelihood as a 
"score" by the dictionary for speech recognition, and the image-processing 
character recognition section mentioned later. Transition probability, a 
probability density function, etc. of HMM are a value which leads, learns like the 
study fault based on the data for study beforehand, and is prepared. 
[0054] A sound model can prepare a phoneme (PLU), syllable, a word, a phrase, 
a sentence, etc. for every unit, for example, Japanese kana "**" - " — it is " 
obtaining " - obtaining -"- " -"- " - it is --"- " - coming --"-- when the sound 
model which makes "**" a unit is used, the words of "yes", no [ "no" ], "good 
morning", whether "whether to be ****** now", etc. can be constituted by 
connecting combining these. A phoneme expresses the pronunciation 
information on a word and is an acoustical and phonological unit. On these 
specifications, it is used without distinguishing a phoneme and PLU 
(Phonone-like unit). The pronounced voice can surely be expressed as a 
combination (PLU train) of a phoneme (PLU). 

[0055] According to HMM, similarity with the characteristic quantity sequence of 
the voice acquired in the language and the microphone 23 which were 
constituted in this way is calculable as a score. As information for constituting 
"language" from a sound model, the language model 123 and the dictionary 121 



for speech recognition are used, in the dictionary 121 for speech recognition, it is 
the dictionary in which the method of connection of the sound model (here - the 
single character "**" of a kana - " - it is - " ... etc. is shown.) for constituting- 
each word used as the candidate for recognition was shown as a 
correspondence table, and the language model 123 shows the regulation of the 
method of connection between a word and a word. 

[0056] In the example shown below, in case a "word" is pronounced on 
recognition processing, it shows the thing with it more convenient [ a unit / treat / 
as one settlement ], and is not necessarily in agreement with a linguistic word. 
For example, although "North Shinagawa" may be treated as one word in the 
following examples, this may be treated as the two words "north" and 
"Shinagawa." Furthermore, it can also treat as one word when pronouncing a 
"North Shinagawa station" and "where a North Shinagawa station to be." 
[0057] Moreover, on these specifications, it uses as mind of the hiragana which 
wrote the "reading kana", or katakana, the "method of pronunciation" is written 
using a Roman alphabet or a Roman alphabet, and a notation, and it is 
equivalent to the "phoneme notation" which can be set linguistically. [ the 
reading of the kanji and an alphabetic word ] [ the actual pronunciation of a~ 
reading kana ] 

[0058] For example, the case where the sentence "the time of - from the time of 



-" is treated is considered, in this case - first - "0 (****)" "1 (****)" ... the word "24 
(**********)", and "time (**)" - " - since - " the method of connection of a word 
is determined by the language - "until" being alike, respectively, and being 
related and referring to the sound model 122. 

[0059] next, "(word showing figure)" and a "time" — " - since ~ " — the method of 
connection of each word for constituting a sentence is determined by referring to 
the language model 123 for each word until [ "until" ] "(word showing figure)", 
and a "time." 

[0060] By applying HMM using this dictionary 121 for speech recognition, and 
the language model 123, similarity with the characteristic quantity sequence 
inputted as the sentences "from 2:00 to 5:00" "from 1:00 to 2:00" can calculate 
as a score. The sentence which consists of a word sequence which has the- 
highest score in it is outpiitted as a speech recognition result. 
[0061] Count of the score in speech recognition processing may be performed 
by carrying out comprehensive evaluation of the acoustical score given with the 
sound model 122, and the linguistic score given with the language model 123. 
[0062] A linguistic score is a score given based on the transition probability or 
the chain probability between n continuous words, for example. Transition 
probability is the value beforehand calculated statistically from a lot of texts, and 
calls this transition probability "n g" here. 



[0063] In addition, a language model may describe the class (what classified the 
word according to a certain criteria and attribute) of a word also besides 
describing a word directly in syntax or n g. 

[0064] for example, — describing the syntax "whether < name of a place >+ is + 
which is + where when the words showing the name of a place are collected 
and the class name the <name of a place> is conferred upon it **** — the inside 
of n g — < name of a place >+ can also prepare the transition probability of + 
where." In this case, it is n= 3 and transition probability is P (<name of a place> | 
is where |) correctly. 

[0065] The output generation section 130 changes into actual actuation the 
response pattern which the dialogue Management Department 110 generated. 
For example, when the response pattern the dialogue Management Department 
110 "utters with + "no" which shakes a neck at right and left" is generated, the 
output generation section 130 generates the voice wave corresponding to "no", 
and outputs it from a loudspeaker 24 while it generates the pattern of operation 
which corresponds for "shaking a neck at right and left" in response and sends it 
toCPUIO. 

[0066] The image-processing character recognition section 140 identifies the 
character string which takes with CCD camera 20 and is contained in a ****** 
image based on the character-pattern database 141. Image patterns, such as an 



alphabetic character of the word of each country, are stored in the 
character-pattern database 141 a hiragana, katakana, the kanji, the alphabet, 
notations, and if needed. The image-processing alphabetic character 
discernment section 140 matches between the input image from CCD camera 
20, and the image pattern stored in the character-pattern database 141, and 
recognizes the character string contained in the input image. 
[0067] The pronunciation information generation section 150 generates the 
pronunciation information corresponding to the character string recognized in 
the image-processing character recognition section 140, i.e., the reading kana of 
a character string, and generates the method (reading) of the pronunciation 
further. For example, the reading [ case, " came and carry out and / ******" ] kana 
the character string "north Shinagawa" has been recognized to be from the input 
image is generated, and the method (reading) of the pronunciation 
"kitashinagawa" is generated in a PLU train. 

[0068] The word reading attribute table 151 is a table which read with the word 
(character string) and described the group of a kana and an attribute, as shown 
in drawing 4 . The attribute shows the semantics which a word has like the 
"name of a place", an "identifier", and an "animal." 

[0069] When the character string recognized in the image-processing character 
recognition section 140 is contained in this table, it is reading from this table and 



extracting a kana, and the method (reading) of the pronunciation of that 
character string can be decided from a reading kana. The word reading attribute 
table 151 is prepared independently [ the dictionary 121 for speech recognition 
[0070] the number of vocabularies of the dictionary for recognition — the 
convenience on a recognition rate, precision, or processing - an upper limit - it 
is (for example, 65,536 words) - on the word reading attribute table 151, a word 
can be described regardless of those limits. This word reading attribute table 151 
can also be diverted from other language resources. For example, the dictionary 
currently used by the kana kanji conversion program, the morphological analysis 
program, etc. can also be diverted. 

[0071] The alphabetic character reading table 152 is a table on which it read with 
the alphabetic character and correspondence with a kana was described, as 
shown in drawing 6 . It reads for every notation, alphabet, or single kanji, and the 
kana is described. If it reads about all usable alphabetic characters and the kana 
is described, it can read to the character string of arbitration and the method 
(reading) of pronunciation can be given from a kana. 

[0072] The regulation for reading, when the reading grant table 153 is read only 
on two tables and a kana cannot be given, and giving a kana, and the regulatioa- 
for specifying this, when a reading kana cannot be specified are described. For 
example, there are a regulation about phonetic reading and unification of native 



Japanese reading, and prolonged-sound-izing, a regulation of ****, a regulation 
about a repeat, and a regulation that gives reading to an alphabetic word. 
[0073] the regulation specifically concerning prolonged-sound-izing — "... it takes 
- " - "... obtaining - it is - " -- etc. - "... -" - "... it is the regulation which is 
acquired and is changed into -" etc. this regulation - for example, - "- ** - 
obtaining - today - " - "-- ** - it is changed into - ****-." the regulation of **** - 
for example, reading of "the exit of Shinagawa" - "-- when carrying out and 
generating from association with ****** (Shinagawa)" and "**** (opening)", it is 
the regulation which makes "****" muddy and is made into "****." Moreover, it is 
the regulation which reads to repeats, such as "******************" t corresponding 
to the regulation about a repeat, and attaches a kana. Furthermore, the 
regulations which read to an alphabetic word and give a kana are regulations, 
like the "e" itself carries out vowel reading of the front vowel to instead of [ which 
is not pronounced ], when there is "e" in the end of the word of an alphabetic 
word. For example, in case the reading kana "Taegu" is given to "take", it is the 
regulation which gives the reading kana "A" to "a" and only gives the reading 
kana "KU" to "ke." 

[0074] Next, the processing at the time of registering a new word into the 

dictionary for recognition is concretely explained using drawing 7 . 

[0075] First, in step S1, it shifts to the word registration mode for word 



registration, the shift to word registration mode - for example, the "register 
mode" to which a user emits robot equipment 1 and "language - memorizing — " 
— etc. — it shifts to word registration mode by making language into a trigger. In 
addition, a manual operation button is prepared, and when this manual operation 
button is pushed, it may be made to shift to word registration mode. 
[0076] In step S2, directions of the purport which utters the reading of the word 
into which a user wants to register the notation of a word to register in addition to 
the directions and/or presentation of a purport which are shown in front of CCD 
camera 20 of robot equipment 1 are urged to robot equipment 1 to a user. The 
case where the contents of directions are displayed on the display which robot 
equipment 1 may direct with voice, and is not illustrated is sufficient as the 
directions to a user. Here, the word "North Shinagawa" is explained as an 
example. The kanji, a kana, a Roman alphabet notation, or a PLU train is also 
available for the alphabetic character presented by the user, concrete - robot 
equipment 1 - "North Shinagawa" — "~ it causes and any notations, such as- 
******", "KITASH I N AGAWA", and "kitashinagawa", can be recognized. 
[0077] In step S3, robot equipment 1 judges whether it is only alphabetic 
character presentation or there was any utterance with alphabetic character 
presentation. Only in alphabetic character presentation, when it progresses to 
step S4 and there is utterance with alphabetic character presentation, it 



progresses to step S8 mentioned later. Only in utterance except it, recognition 
processing by the garbage model is performed as usual. 

[0078] First, the case of only alphabetic character presentation is explained. 
Only in alphabetic character presentation, in step S4, the image-processing 
character recognition section 140 in robot equipment 1 carries out character 
recognition (OCR:Optical Charactor Recognition) of what kind of character string 
is contained in the image picturized in CCD camera 20 based on the 
character-pattern database 141. here, the candidate of a character recognition 
result needs to narrow down the image-processing character recognition section 
140 to one - when there is nothing, it leaves two or more candidates. For 
example, when the recognition result of '•******" is obtained to the alphabetic 
character "North Shinagawa", it leaves "******." 

[0079] Then, in step S5, the pronunciation information generation section 150 in 
robot equipment 1 generates the method (reading) of the pronunciation of a 
character string to the character string obtained as a recognition result of step 
S4. The detail at the time of generating pronunciation is mentioned later. The 
method (reading) of pronunciation is given to a character string by pronunciation^ 
generation processing. When there are two or more recognized character strings, 
and/or when the method of two or more pronunciation is possible to one 
character string, all pronunciation patterns are applied. 



[0080] In step S6, robot equipment 1 checks to a user which the method 
(reading) of the pronunciation to the character string generated as mentioned 
above should adopt among two or more reading [ be / it / the right ]. In a general 
case, the method (reading) of pronunciation asks a question as if "reading is the_ 
right in OO." When a user returns the "right" and the response of "yes" etc., it 
progresses to step S7. 

[0081] Moreover, when there are two or more kinds of methods (reading) of 
pronunciation, a question is asked as if [ each ] "reading is OO." A user adopts 
the reading which returned the "right" and the response of "yes" etc., and 
progresses to step S7. 

[0082] When the response of "no" etc. is received from a user (i.e., when right 
reading does not exist), it returns to processing of step S2 or step S4. 
[0083] By the above processing, after deciding reading of a new word, the 
method (reading) of the pronunciation to the character string which progressed 
to step S7 and was acquired, and this character string is matched, and it 
registers with the dictionary for recognition as a new word. In case a new word is 
added, the recognition result of the shown alphabetic character is used for the 
word symbol column shown in drawing 4 . The method (reading) of the 
pronunciation decided in step S6 is described by PLU **** corresponding to thia__ 
character string. Register mode is ended after registering a new word. Then, the 



processing for making the updated dictionary for recognition reflect in speech 
recognition, for example, the reboot of a speech recognition program etc., is 
performed. 

[0084] The case where the alphabetic character written in step S3 on the other 
hand while the user presented the alphabetic character is uttered is explained. 
When alphabetic character presentation has utterance, pronunciation, 
information, such as a PLU train, can be generated with a sufficient precision by 
using cooperatively the information acquired from both. 

[0085] Specifically, two or more reading kanas presumed from two or more 
alphabetic characters presumed from the result of character recognition and 
each [ these ] alphabetic character and the method (reading) of the 
pronunciation corresponding to each reading kana are generated. Thus, by 
matching the method (reading) of two or more obtained pronunciation, and 
utterance from a user acquired in the microphone 23, one reading kana and the 
method (reading) of pronunciation are specified out of two or more candidates 
generated as mentioned above. 

[0086] When there is utterance with alphabetic character presentation, in step 
S8, character recognition of the image-processing character recognition section 
140 in robot equipment 1 is carried out from the image picturized in CCD camera 
20. here, the candidate of a character recognition result needs to narrow down 



the image-processing character recognition section 140 to one — when there is 
nothing, it leaves two or more candidates. 

[0087] Then, in step S9, the pronunciation information generation section 150 in 
robot equipment 1 generates the reading kana of a character string to the 
character string obtained as a recognition result of step S8. The method 
(reading) of pronunciation is given to a character string by pronunciation 
generation processing. When there are two or more recognized character strings, 
when two or more reading is possible, all pronunciation patterns are applied tQ_ 
one character string. 

[0088] Next, in step S10, the temporary dictionary for recognition is temporarily 
generated from a character string and the method (reading) of pronunciation. 
This dictionary is hereafter described as the dictionary for recognition for new 
words. For example, suppose that the alphabetic character "North Shinagawa" 
picturized by CCD camera 20 has been recognized by two kinds, "North 
Shinagawa" and "******" f j n the image-processing character recognition section 
140. The speech information generation section 150 is read to "North 
Shinagawa" and "******", and gives a kana. "North Shinagawa" - "- it causes, 
******" is given, two kinds, "** vacancies are **", are given to "******", and it is 
generated further, the method (reading), i.e., the PLU train, of both pronunciation. 
[ "******** is **" and ] The dictionary for recognition for new words in this case is 



shown in drawing 8 . 

[0089] In step S11, speech recognition is performed to utterance from a user 
using the dictionary for recognition for new words. Speech recognition here is 
not a continuous speech recognition but word speech recognition. When the 
user has spoken before rather than the dictionary for recognition for new words 
is generated, the utterance is recorded and speech recognition is performed to 
the sound recording voice. The speech recognition in step S11 is discovering a 
user's utterance and the acoustical nearest word out of the word registered into 
the dictionary for recognition for new words. However, in processing of step S1 1 , 
even if a word symbol is the same, when PLU trains differ, it is regarded as- 
another word. 

[0090] in drawing 8 , it is utterance of the user out of three words (two "******" 
regards it as another word) registered here — "— it is causing and discovering the 
word nearest to ******." As a result, the group of a word symbol and a PLU train 
can be specified as one. 

[0091] If the group of a word symbol and a PLU train is specified out ot the 
dictionary for recognition for new words, this will be registered into the dictionary 
121 for speech recognition of normal in step S7. Register mode is ended after 
registering a new word. Then, the processing for making the updated dictionary 
for recognition reflect in speech recognition, for example, the reboot of a speech 



recognition program etc., is performed. 

[0092] By processing shown above, robot equipment 1 can register the word 
which is not memorized by the dictionary 121 for speech recognition as a new 
word. 

[0093] Generation of the method (reading) of the pronunciation of the characteT" 
string in step S5 and step S9 which were mentioned above is explained to a 
detail using drawing 9 . 

[0094] First, in step S21, it investigates whether the character string recognized 
by the image-processing character recognition section 140 consists of only kana 
alphabetic characters. However, with a kana alphabetic character here, a 
macron a repeat "**- — ", etc. are included other than a hiragana and 
katakana. When the character string consists of only kana alphabetic characters, 
let the recognized kana alphabetic character be the reading of the character 
string in step S22. At this time, the pronunciation of prolonged-sound-izing etc. 
may be corrected a little. 

[0095] On the other hand, when the character string recognized by the 
image-processing character recognition section 140 contains alphabetic 
characters other than a kana alphabetic character in step S21, in step S23, it 
distinguishes whether the character string is contained in the word reading 
attribute table 151. 



[0096] When the character string is contained in the word reading attribute table 
151, it reads from the table, a kana is acquired, and the method (reading) of 
pronunciation is generated further (step S24). Moreover, when the attribute of a 
word is described by the word reading attribute table 151, an attribute is also 
acquired to coincidence. About the usage of this attribute, it mentions later. 
[0097] When the character string is not contained in the word reading attribute 
table 151 , in step S25, it reads combining the reading grant based on the longesT 
match principle and the division minimum method, and the alphabetic character 
reading table 152, and the reading grant based on a reading grant regulation, 
and a kana is acquired. 

[0098] They are whether the same thing as an input string can be constituted 
from combining with the longest match principle and the number-of-partitions 
minimum method two or more words contained in the word reading attribute 
table 151, and the approach of trying, as a result if "North Shinagawa" and a 
"station front" are included even if this is not contained in the word reading 
attribute table 151 when an input string is "in front of a North Shinagawa station", 
since the "North Shinagawa station front" can be constituted from such 
combination - it causes and the reading" before ********** can be acquired, 
the direction where a longer word is contained when there are two or more kinds 
of configuration approaches — giving priority (longest match principle) — the 



direction which can be constituted from fewer words — giving priority (the 
number-of-partitions minimum method) - it carries out and the configuration 
approach is chosen. 

[0099] Moreover, the reading grant based on the alphabetic character reading 
table 152 divides a character string for every alphabetic character, and is the 
approach of reading from the alphabetic character reading table 152 for every 
alphabetic character, and acquiring a kana which divided. Since two or more 
reading kanas can be given to one kanji in the case of the kanji, the reading kana 
as the whole character string becomes the combination of the reading kana of- 
each kanji. Therefore, for example, it is the approach of reducing the number of 
combination using the regulation of "phonetic reading and native Japanese 
reading cannot be intermingled easily." 

[0100] Then, in step S26, a score or reliability is calculated to the candidate of 
each reading kana acquired by the above-mentioned all directions method, and 
a high thing is chosen. This reads to the inputted character string and a kana can 
be given. The method (reading) of pronunciation is generated from the obtained 
reading kana. 

[0101] After passing through each process of step S22, step S24, and step S26, 
finally in step S27, the method (reading) of the pronunciation to a reading kana is 
corrected based on regulations, such as prolonged-sound-izing and ****-izing. 



[0102] Here, the word reading attribute table 151 is explained to a detail. Only by 
newly registering a word into the dictionary 121 for speech recognition, the 
connection regulation between the words recorded on the language model 123 
is inapplicable. For example, even if it carries out additional registration of "North 
Shinagawa" at the dictionary 121 for speech recognition, the chain probability of 
the syntax or "North Shinagawa" about "North Shinagawa", and other words etc. 
is not generated only by it. Therefore, ideally, the method of making the 
connection regulation of a language model reflect in a new registration word 
adds syntax, or recalculates a chain probability from text data, and although it is 
reconstituting a language model, a language model is applicable by the simple 
approach shown below after new registration. 

[0103] First, the class name of a <unknown word> is attached to the word which 
is not contained in the language model. To the language model, the chain 
probability of a <unknown word> and other words is described. It considers that 
a new registration word is a <unknown word>, and it calculates the chain 
probability of this new registration word and other words from the chain 
probability of a <unknown word> and other words. 

[0104] With a class, a word is classified according to a certain criteria and 
attribute. For example, it classifies according to semantics, and each is named 
the <name of a place>, a <family name>, and the <name of a country>, or it 



classifies according to a part of speech, and each is named a <noun>, a <verfc», 
and a <adjective>. 

[0105] To a language model, the chain probability between classes and the 
chain probability of a class and a word are described instead of describing the 
chain probability between words. When searching for the chain probability 
between words, it investigates to which class a word belongs, the chain 
probability about the class corresponding to a degree is searched for, and the 
chain probability between words is calculated from there. 

[0106] A class model is applicable by presuming which class it is a word 
belonging to also about a new registration word at the time of registration. 
[0107] When it is made above, in the model for unknown words, the chain 
probability of the same value is altogether given to a new registration word. With 
a class model, it becomes a value which is different to which class it belongs to it. 
Therefore, generally, the linguistic score about a new registration word turns into 
a score with more suitable using a class model, and is recognized appropriately 
as a result. 

[0108] Therefore, in the word registration by speech recognition, the difficult 
class name can input easily conventionally. That is, when the character string 
(word) obtained by character recognition is contained in the word reading 
attribute table 151, a class name can be acquired from the attribute column of 



this table. In addition, in the example shown in drawing 5 , although only one has 
described the attribute in the attribute column, two or more these can also be 
described like "< name of a place >, a <proper noun>, and a <name of the 
station>." When the class the <name of a place> exists in this case, the 
classification name which is in agreement with a class name, i.e., the <name of a 
place>, is adopted out of the <name of a place>, a <proper noun>, and a <name 
of the station>. 

[0109] In character recognition, precision of direction recognized including the 
information about the chain of an alphabetic character may improve rather than 
recognizing a single character every. Then, the precision of character 
recognition can be further improved by using the "word symbol" column of the 
dictionary for recognition, the "word" column of the word reading attribute table 
151, etc. as information about the chain of an alphabetic character. 
[0110] Although the above explanation explained the case of character 
recognition as recognition of the predetermined pattern in an acquisition image- 
The alphabetic character (character string) which recognizes the image of the 
configuration of a body besides an alphabetic character (character string), a 
profile, a shank, and the body itself, and corresponds as mentioned above is 
extracted. The voice wave which gives two or more kinds of reading kanas 
presumed from the extracted alphabetic character, and is equivalent to each of 



two or more kinds of given reading kanas is also generable. In this case, in 
addition to the fundamental configuration shown in drawing 1 , a required^ 
configuration is added if needed. 

[0111] Thus, by enabling it to master the method of pronunciation as a 
predetermined pattern corresponding to various cases besides a character 
string, robot equipment can express signs that information is acquired and 
learned from the exterior, and entertainment nature can be improved. 
[0112] By the way, the robot equipment 1 shown as a gestalt of this operation is 
robot equipment which can act autonomously according to an internal state. The 
software configuration of the control program in robot equipment 1 comes to be 
shown in drawing 10 . As mentioned above, this control program is beforehand 
stored in the flash ROM 12, and is read at the time of the powering-on early 
stages of robot equipment 1 . 

[01 13] In drawing 10 , the device driver layer 30 is located in the lowest layer of a 
control program, and consists of device driver sets 31 which consist of two or 
more device drivers. In this case, each device driver is the object allowed to 
carry out direct access to the hardware used by usual computers, such as CCD_ 
camera 20 ( drawing 2 ) and a timer, and processes in response to interruption 
from corresponding hardware. 

[0114] Moreover, the ROBOTIKKU server object 32 With the virtual robot 33 



which becomes by the software group which offers the interface for being 
located in the lowest layer of the device driver layer 30, for example, accessing 
hardware, such as various above-mentioned sensors and Actuators 251 -25n 
With the power manager 34 who becomes by the software group whictL 
manages the change of a power source etc. It consists of a device driver 
manager 35 who becomes by the software group which manages other various 
device drivers, and a dither INDO robot 36 which becomes by the software 
group which manages the device of robot equipment 1 . 

[0115] The manager object 37 consists of an object manager 38 and a service 
manager 39. The object manager 38 is a software group which manages starting 
and termination of each software group contained in the ROBOTIKKU server 
object 32, the middleware layer 40, and the application layer 41, and a service 
manager 39 is a software group which manages connection of each object 
based on the initial entry between each object described by the connection file 
stored in the memory card 28 ( drawing 2 ). 

[01 16] The middleware layer 40 is located in the upper layer of the ROBOTIKKU 
server object 32, and consists of software groups which offer the fundamental 
function of these robot equipments 1 , such as an image processing and speech 
processing. Moreover, the application layer 41 is located in the upper layer of th§_ 
middleware layer 40, and consists of software groups for opting for action of 



robot equipment 1 based on the processing result processed by each software 
group which constitutes the middleware layer 40 concerned. 
[0117] In addition, the concrete software configuration of the middleware layer 
40 and the application layer 41 is shown in drawing 1 1 , respectively. 
[0118] As shown in drawing 11 , the middleware layer 40 For noise detection, 
The object for temperature detection, the object for brightness detection, the_ 
object for scale recognition, the object for distance detection, for posture 
detection, The recognition system 60 which has input semantics converter 
module 59 grade in the object for touch sensors, the object for motion detection, 
and each signal conditioning module 50 for color recognition - 58 lists, It consists 
of output systems 69 which have the object for posture management, the object 
for tracking, the object for motion playback, the object for a walk, the object for a 
fall return, an object for LED lighting, and each signal conditioning module 61 for 
sound playback - 67 grades in output semantics converter module 68 list. 
[0119] Each signal conditioning modules 50-58 of the recognition system 60 
incorporate the data with which it corresponds of each sensor data read from 
DRAM1 1 ( drawing 2 ) by the virtual robot 33 of the ROBOTIKKU server object 
32, or image data and voice data, perform predetermined processing based on 
the data concerned, and give a processing result to the input semantics 
converter module 59. Here, the virtual robot 33 is constituted by the 



predetermined protocol as a part which carries out transfer or conversion of a 
signal. 

[0120] The input semantics converter module 59 It is based on the processing 
result given from each [ these ] signal conditioning modules 50-58. "The fall was 
detected", [ "it is "noisy", hot / "hot" /, and bright", "the ball having been detected", 
and ] The self of "it was stroked", "it having been struck", "the scale of C-E-G 
having been heard", "the body which moves having been detected", "having 
detected the obstruction", etc. and a surrounding situation, the command from a 
user, and influence are recognized, and a recognition result is outputted to the 
application layer 41. 

[0121] The application layer 41 consists of five modules, the behavioral model 
library 70, the action change module 71, the study module 72, the feeling model 
73, and the instinct model 74, as shown in drawing 12 . 

[0122] As shown in drawing 13 , "when a dc-battery residue decreases", "when 
avoiding an obstruction, and expressing feeling", the behavioral model library 70 
is made to correspond to the condition item of the shoes "at the time of detecting 
a ball" etc. chosen beforehand, respectively, and the behavioral model which 
became independent, respectively "is prepared [ "a fall return is carried out" 
and ] in it." 

[0123] And the time of a recognition result being given from the input semantics 



converter module 59, respectively, as for these behavioral models, The 
parameter value of the corresponding emotion currently held like the 
after-mentioned at the feeling model 73 if needed when fixed time amount has 
passed, after the last recognition result is given, It opts for the action which 
continues while referring to the parameter value of the corresponding desire 
currently held at the instinct model 74, respectively, and a decision result is 
outputted to the action change module 71. 

[0124] In the case of the gestalt of this operation, in addition, each behavioral 
model As the technique of opting for the next action As opposed to the arcs 
ARC1-ARCn1 which connect between to each node NODE0 - NODEn for to__ 
other nodes NODE0 of which - NODEn it changes from one node (condition) 
NODE0 as shown in drawing 14 - NODEn The algorithm called the finite 
stochastic automaton determined probable based on the transition probability 
P1-Pn set up, respectively is used. 

[0125] Concretely, each behavioral model is made to correspond to the node 
NODE0 which forms a self behavioral model, respectively - NODEn, respectively, 
and has the state transition table 80 as shown in drawing 15 for every these 
node NODE0 - NODEn. 

[0126] In this state transition table 80, the input event (recognition result) made 
into transition conditions in that node NODE0 - NODEn is listed by the line of an 



"input event name" at a priority, and the further conditions about that transition 
condition are described by the "data name" and the corresponding train in the 
line of the "data range." 

[0127] therefore, in the node NODE100 expressed in the state transition table 80 
of drawing 15 When the recognition result of "detecting a ball (BALL)" is giveo_ 
The range of "magnitude (SIZE)" of the ball given with the recognition result 
concerned is "0 to 1000", When the recognition result of "detecting an 
obstruction (OBSTACLE)" is given, they have been conditions for that the range 
of "the distance (DISTANCE)" to the obstruction done with the recognition result 
concerned is "0 to 100" to change to other nodes. 

[0128] Moreover, in this node NODE100, when there is no input of a recognition 
result, it also sets. The inside of each emotion held at the feeling model 73 and 
the instinct model 74 which a behavioral model refers to periodically, 
respectively, and the parameter value of each desire, it was held at the feeling 
model 73 - "~ glad (Joy) — " - "- surprised (Surprise) ~ " -- or - "-- feeling sad 
(Sadness) — " — when the range of which parameter value is "50 to 100", it can 
change to other nodes. 

[0129] moreover — a state transition table 80 - "— others, while the node name 
which can change from the node NODE0 - NODEn in the train of the "transition 
place node" in the column of transition probability" of NODOHE is listed It is_ 



described by the part where it corresponds in the column of transition probability" 
of NODOHE, respectively, the transition probability to each of other node 
NODEO which can change when all the conditions described by the line of an 
"input event name", a "data name", and the "range of data" are met - NODEn — 
others — the action which should be outputted in case it changes to the node 
NODEO - NODEn - others — it is described by the line of "output action" in the 
column of transition probability" of NODOHE. in addition — others — the suitl 
of the probability of each line in the column of transition probability" of NODOHE 
is 100 [%]. 

[0130] therefore, in the node NODE100 expressed in the state transition table 80 
of drawing 15 for example, when the recognition result that it carries out 
"detecting a ball (BALL)", and the range of "SIZE (magnitude)" of the ball is "0 to 
1000" is given It can change to "a node NODE120 (node 120)" by the probability 
of "30 [%]", and action of "ACTION 1" will be then outputted. 
[0131] When they are constituted as a lot of nodes NODEO described as such 
[ respectively ] a state transition table 80 - NODEn(s) are connected, and a 
recognition result is given from the input semantics converter module 59, each 
behavioral model opts for the next action probable using the state transition table 
of the node NODEO - NODEn, and is made as [ output / to the action change 
module 71 / a decision result ]. 



[0132] The action change module 71 shown in drawing 12 chooses from each 
behavioral model of the behavioral model library 70 the action outputted from the 
high behavioral model of the priority beforehand defined among the actions 
outputted, respectively, and sends out the command (this is hereafter called 
action command.) of the purport which should perform the action concerned to 
the output semantics converter module 68 of the middleware layer 40. In 
addition, in the gestalt of this operation, priority is highly set up for the behavioral 
model written by the bottom in drawing 13 . 

[0133] Moreover, the action change module 71 notifies that the action was., 
completed based on the completion information of action given from the output 
semantics converter module 68 after the completion of action to the study 
module 72, the feeling model 73, and the instinct model 74. 
[0134] On the other hand, the recognition result of the instruction received as 
influence from a user, such as the study module 72 "was struck" among the 
recognition results given from the input semantics converter module 59 and "it 
having been stroked", is inputted. 

[0135] And when the study module 72 "is struck" (scolded) based on the notice 
from this recognition result and the action change module 71, the manifestation 
probability of that action is reduced, and when "stroked" (praised), the transition 
probability to which the corresponding behavioral model in the behavioral model 



library 70 corresponds so that the manifestation probability of that action may be 
raised is changed. 

[0136] on the other hand, the feeling model 73 - glad (Joy) - " - feeling 
sad (Sadness) — " — "- getting angry (Anger) - " — surprised (Surprise) - " — 
"dislike (Disgust)" — and — afraid (Fear) - " - the parameter with which the 
strength of the emotion is expressed for every emotion is held about a total of six 
emotions. And the feeling model 73 updates the parameter value of each 
[ these ] emotion periodically based on the notice from the specific recognition 
result to which it is given from the input semantics converter module 59, 
respectively, such as "it having been struck" and "it having been stroked", and 
elapsed time and the action change module 71 etc. 

[0137] The recognition result to which the feeling model 73 is specifically given 
from the input semantics converter module 59, The amount of fluctuation of the 
action and its emotion when being computed by predetermined operation 
expression based on the elapsed time after updating last time etc. of the robot 
equipment 1 at that time **E [t], The multiplier which expresses the sensibility of 
E [t] and its emotion for the parameter value of the current emotion is set to ke. 
(1) By the formula, as parameter value [ of the emotion in the following period ] E 
[t+1] is computed and this is replaced with parameter value [ of the current 
emotion ] E [t], the parameter value of the emotion is updated. Moreover, the 



feeling model 73 updates the parameter value of all emotions like this. 

[0138] 

[Equation 1] 

E[t + l]=E[t] + kexAE[t] 

[0139] In addition, it is decided beforehand what the notice from each recognition 
result or the output semantics converter module 68 has effect of on amount of 
fluctuation **[ of the parameter value of each emotion ] E [t]. For example, the 
recognition result of "having been struck" has big effect on amount of fluctuation 
**[ of the parameter value of the emotion of the "resentment" ] E [t], and the 
recognition result of "having been stroked" has big effect on amount of 
fluctuation **[ of the parameter value of the emotion of "joy" ] E [t]. 
[0140] Here, the notice from the output semantics converter module 68 is the 
so-called feedback information (the completion information of action) of action, 
and is the information on the appearance result of action, and the feeling model 
73 changes feeling also using such information, this — for example, the feeling 
level of the resentment falls by action of "barking" — like - they are things. In 
addition, the notice from the output semantics converter module 68 is inputted 



also into the study module 72 mentioned above, and the study module 72 
changes the transition probability to which a behavioral model corresponds 
based on the notice. 

[0141] In addition, feedback of an action result may be made with the output 
(action to which feeling was added) of the action change modulator 71. 
[0142] on the other hand, "movement avarice (exercise)", "love avarice 
(affection)", "appetite (appetite)", and "the curiosity (curiosity) of the instinct 
model 74" are mutually-independent — the parameter with which the strength of- 
the desire is expressed for these the desires of every is held about four desires 
the bottom. And the instinct model 74 updates the parameter value of these 
desires periodically based on the recognition result to which it is given from the 
input semantics converter module 59, respectively, the notice from elapsed time 
and the action change module 71 , etc. 

[0143] The instinct model 74 specifically about "movement avarice", "love 
avarice", and "curiosity" The amount of fluctuation of that the desire of the when 
being computed by predetermined operation expression based on the notice 
from a recognition result, elapsed time, and the output semantics converter 
module 68 etc. deltal [k], The parameter value of the current desire as a 
multiplier ki showing the sensibility of I [k] and its desire As parameter value [ of 
that desire in the following period ] I [k+1] is computed using (2) types with a 



predetermined period and this result of an operation is replaced with parameter 
value [ of that current desire ] I [k], the parameter value of that desire is updatecL- 
Moreover, the instinct model 74 updates the parameter value of each desire 
except "appetite" like this. 
[0144] 
[Equation 2] 

l[k+l]«l[k]4-kixAl[k] < Z > 

[0145] In addition, it is decided beforehand what the notice from a recognition- 
result and the output semantics converter module 68 etc. has effect of on 
amount of fluctuation **[ of the parameter value of each desire ] I [k], for example, 
it has effect to amount of fluctuation **[ of the parameter value of the "fatigue" ] I 
[k] with the big notice from the output semantics converter module 68. 
[0146] In addition, in the gestalt of this operation, it is regulated so that each 
emotion and the parameter value of each desire (instinct) may be changed in the 
range from 0 to 100, respectively, and the value of multipliers ke and ki is also 
set up according to the individual for each [ an emotion and ] the desire of every. 
[0147] On the other hand, abstract action commands, such as it being [ which is 
given from the action change module 71 of the application layer 41 as mentioned 



above ] "advance", "it being glad", the output semantics converter module 68 of 
the middleware layer 40 "cries", as shown in drawing 11 , or "tracking (a ball is 
pursued)", are given to the signal conditioning modules 61-67 with which the 
output system 69 corresponds. 

[0148] And these signal conditioning modules 61-67 The servo command value 
which should be given to the actuators 251-25n ( drawing 2 ) in order to carry out 
the action based on the action command concerned, if an action command is 
given, Or the drive data given to LED of a "eye" are generated, the voice data of 
the sound outputted from a loudspeaker 24 ( drawing 2 ) - and — Sequential 
sending out of these data is carried out at the actuators 251 -25n which 
correspond through the virtual robot 33 and digital disposal circuit 14 ( drawing 
2 ) of the ROBOTIKKU server object 32 one by one, a loudspeaker 24, or LED.~" 
[0149] Thus, robot equipment 1 can perform the situation of self (interior) and a 
perimeter (exterior), the directions from a user, and autonomous action 
according to influence based on a control program. Therefore, since 
character-recognition processing mentioned above performs, the 
character-recognition processing shown in drawing 7 can perform by making the 
control program for performing processing which determines the method of the 
pronunciation of the alphabetic character extracted from the image by character 
recognition processing based on the voice recognized from the surrounding 



sound by speech-recognition processing also to the robot equipment which is 
not equipped with the program read. 

[0150] Such a control program is offered through the record medium recorded in 
the format which robot equipment can read. As a record medium which records a 
control program, the record medium (for example, a magnetic tape, a floppy 
(trademark) disk, a magnetic card) of a magnetic reading method, the record 
medium (for example, CD-ROM, MO, CD-R, DVD) of an optical reading method, 
etc. can be considered. Storages, such as semiconductor memory (the so-called 
memory card (configurations, such as a rectangle mold and a square mold, are 
not asked.), IC card), are also contained in a record medium. Moreover, a control 
program may be offered through the so-called Internet etc. 
[0151] It is reproduced through the reading driver equipment of dedication, or a 
personal computer, and these control programs are transmitted and read into 
robot equipment 1 by the cable or wireless connection. Moreover, robot 
equipment can also read a control program from these storages directly, when it 
has drive equipment of miniaturized storages, such as semiconductor memory 
or an IC card. With robot equipment 1, it can read from a memory card 28. 
[0152] In addition, as for this invention, it is needless to say for various 
modification to be possible in the range which is not limited only to the gestalt of 
operation mentioned above and does not deviate from the summary of this 



invention. With the gestalt of this operation, although the robot equipment of 
quadrapedalism was explained, robot equipment may be a 2-pair-of-shoes walk, 
and a migration means is not further limited to leg formula move mode. 
[0153] Below, the detail of the humanoid robot equipment shown as a gestalt of 
another operation of this invention is explained. Signs that humanoid robot 
equipment 200 was viewed from each of the front and back are shown in 
drawing 16 and drawing 17 . Furthermore, the joint degree-of-freedom 
configuration which this humanoid robot equipment 200 possesses is typically 
shown in drawing 18 . 

[0154] As shown in drawing 16 , humanoid robot equipment 200 consists of the 
truncus sections which connect an upper extremity including two arms and a 
head 201, the membrum inferius which consists of the two legs which realize 
migration actuation, and an upper extremity and the membrum inferius. 
[0155] The neck joint which supports a head 201 has three degrees of freedom 
called the neck joint yawing axis 202, the neck joint pitching axis 203, and the 
neck joint roll axes 204. 

[0156] Moreover, each carpus consists of the shoulder-joint pitching axis 208, 
the shoulder-joint roll axes 209, the overarm yawing axis 210, the elbow-joint 
pitching axis 211, the forearm yawing axis 212, a wrist joint pitching axis 213, a 
wrist joint roll ring 214, and a hand part 215. Hand parts 215 are the many joints 



and the multi-degree-of-freedom structure containing two or more fingers in fact. 
However, since there are little the contribution and effect to attitude control or 
walk control of humanoid robot equipment 200, actuation of a hand part 215 is 
assumed to be a zero degree of freedom on these specifications. Therefore, 
each arm presupposes that it has seven degrees of freedom. 
[0157] Moreover, the truncus section has three degrees of freedom called the~ 
truncus pitching axis 205, the truncus roll axes 206, and the truncus yawing axis 
207. 

[0158] Moreover, each leg which constitutes the membrum inferius consists of 
the hip joint yawing axis 216, the hip joint pitching axis 217, the hip joint roll axes 
218, the knee-joint pitching axis 219, an ankle joint pitching axis 220, ankle joint 
roll axes 221, and a foot 222. In this specification, the intersection of the hip joint 
pitching axis 217 and the hip joint roll axes 218 defines the hip joint location of 
humanoid robot equipment 200. In fact, although the foot 222 of the body is the 
structure containing the vola of many joints and many degrees of freedom, the 
vola of humanoid robot equipment 200 makes it a zero degree of freedom. 
Therefore, each leg consists of six degrees of freedom. 

[0159] If the above is summarized, as the humanoid robot equipment 200 whole, 
it will have 2= 3+7x2+3+6x32 degree of freedom in total. However, the humanoid 
robot equipment 200 for entertainment is not necessarily limited to 32 degrees of 



freedom. It cannot be overemphasized that a degree of freedom, i.e., the 
number of joints, can be suitably fluctuated according to a constraint, 
requirement specification, etc. on a design / work. 

[0160] The degree of means is mounted using an actuator in fact each one 
which humanoid robot equipment 200 which was mentioned above has. As for 
the request of eliminating an excessive swelling by the exterior and making it 
approximate in the shape of [ human ] a natural bodily shape, performing attitude 
control to the unstable structure called a 2-pair-of-shoes walk to an actuator, it is_ 
desirable that it is small and lightweight. 

[0161] The control-system configuration of humanoid robot equipment 200 is 
typically shown in drawing 19 . As shown in this drawing, humanoid robot 
equipment 200 consists of each device unit 230,240,250 R/L and 260 R/L 
expressing the human limbs, and a control unit 280 which performs adaptive 
control for realizing coordination actuation between each device unit (however, 
each of R and L is a suffix which shows each of the right and the left.), the 
following -- the same . 

[0162] Actuation of the humanoid robot equipment 200 whole is controlled by the 
control unit 280 in generalization. A control unit 280 consists of circumference 
circuits 282 including the interface (neither is illustrated) which performs the data 
of the main control section 281 which consists of main circuit components (not 



shown), such as CPU (Central Processing Unit) and memory, and each 
component of a power circuit or humanoid robot equipment 200, and transfer of 
a command. Especially the installation of this control unit 280 is not limited. 
Although carried in the truncus section unit 240 in drawing 19 , you may carry in 
the head unit 230. Or a control unit 280 is arranged out of humanoid robot 
equipment 200, and you may make it communicate with the airframe of 
humanoid robot equipment 200 by the cable or wireless. 

[0163] Each joint degree of freedom in the humanoid robot equipment 200 
shown in drawing 19 is realized by the actuator corresponding to each. That is, 
the neck joint yawing axis 202, the neck joint pitch 203, the neck joint 
yawing-axis actuator A2 expressing each of the neck joint roll axes 204, neck 
joint pitching-axis actuator A3, and neck joint roll-axes actuator A4 are arranged 
in the head unit 230. 

[0164] Moreover, the CCD (ChargeCoupled Device) camera for picturizing an 
external situation is formed in the head unit 230, and also the touch sensor for 
detecting the loudspeaker for outputting the microphone for collecting the 
distance robot for measuring the distance to the body located ahead and alien 
frequencies and voice and the pressure received by "it strokes" and the physical 
influence of "striking" from a user etc. is arranged. 

[0165] Moreover, truncus pitching-axis actuator A5 expressing each of the 



truncus pitching axis 205, the truncus roll axes 206, and the truncus yawing axis 
207, the truncus roll-axes actuator A6, and the truncus yawing-axis actuator A7 
are arranged in the truncus section unit 240. Moreover, the truncus section unit 
240 is equipped with the dc-battery used as the starting power source of this 
humanoid robot equipment 200. This dc-battery is constituted by the cell in 
which charge and discharge are possible. 

[0166] Moreover, although subdivided by overarm unit 251 R/L, elbow-joint unit 
252 R/L, and forearm unit 253 R/L, arm unit 250 R/L The shoulder-joint 
pitching-axis actuator A8, shoulder-joint roll-axes actuator A9 which the 
shoulder-joint pitching axis 8, the shoulder-joint roll axes 209, the overarm 
yawing axis 210, the elbow-joint pitching axis 211, the forearm yawing axis 212, 
the wrist joint pitching axis 213, and the wrist joint roll axes 214 express 
respectively, The overarm yawing-axis actuator A10, the elbow-joint 
pitching-axis actuator A11, the elbow-joint roll-axes actuator A12, the wrist joint 
pitching-axis actuator A13, and the wrist joint roll-axes actuator A14 are 
arranged. 

[0167] Moreover, although subdivided by femoral region unit 261 R/L, knee unit 
262 R/L, and leg part unit 263 R/L, leg unit 260 R/L The hip joint yawing-axis 
actuator A16, the hip joint pitching-axis actuator A17 expressing each of the hip 
joint yawing axis 216, the hip joint pitching axis 217, the hip joint roll axes 218^ 



the knee-joint pitching axis 219, the ankle joint pitching axis 220, and the ankle 
joint roll axes 221, The hip joint roll-axes actuator A18, the knee-joint 
pitching-axis actuator A19, the ankle joint pitching-axis actuator A20, and the 
ankle joint roll-axes actuator A21 are arranged, the actuator A2 used for each 
joint, and A3 — it can constitute from a small AC servo actuator of the type 
which ... one-chip-ized the **** servo control system by the gear direct 
attachment type more preferably, and was carried in the motor unit. 
[0168] Sub control section 235,245,255 R/L of an actuator drive control section 
and 265 R/L are arranged for every device unit, such as the head unit 230, the 
truncus section unit 240, the arm unit 250, and each leg unit 260. furthermore, 
each leg 260 - while equipping with the touch-down check sensors 291 and 292 
which detect whether the vola of R and L was implanted, the attitude sensor 293 
which measures a posture is equipped in the truncus section unit 240. 
[0169] The touch-down check sensors 291 and 292 consist of proximity sensors 
or micro switches etc. which were installed in the vola. Moreover, an attitude 
sensor 293 is constituted by the combination of an acceleration sensor and a 
gyroscope sensor. 

[0170] the output of the touch-down check sensors 291 and 292 — during 
periods of operation, such as a walk and transit, — setting — each leg on either 
side -- the present basis or **** — it can distinguish whether it is in which 



condition. Moreover, the inclination and posture of a truncus part are detectable 
with the output of an attitude sensor 293. 

[0171] The main control section 281 can answer the output of each sensors 
291-293, and can amend control objectives dynamically. Accommodative control 
is performed to each of sub control section 235,245,255 R/L and 265 R/L, and, 
more specifically, the upper extremity of humanoid robot equipment 200, the 
truncus, and the membrum inferius can realize the exercise-of-the-whole-body 
pattern driven in cooperation. 

[0172] the exercise of the whole body on the airframe of humanoid robot 
equipment 200 - a foot ~ while setting up movement, a ZMP (ZeroMoment 
Point) orbit, truncus movement, upper extremity movement, lumbar part height, 
etc., the command which directs actuation according to these contents of a 
setting is transmitted to each sub control section 235,245,255 R/L and 265 R/L. 
And in each sub control sections 235 and 245 and the receiving command 
from the main control section 281 is interpreted, and a drive control signal is 
outputted to each actuator A2, A3, etc. "ZMP" here is a point on the floor line 
where the moment by the floor reaction force during a walk serves as zero, and 
a "ZMP orbit" means the locus to which ZMP moves during the walk actuation 
period of humanoid robot equipment 200. 

[0173] These moments act on gravity, inertial force, and a list from a walk 



system at a road surface with the acceleration produced in connection with 
gravity and locomotion at the time of a walk. According to so-called "d'Alemberfs 
principle", they balance with the floor reaction force as reaction from a road 
surface to a walk system, and the floor-reaction-force moment. As a conclusion 
of dynamic inference, the point (Zero Moment Point), i.e., "ZMP", that a pitch and 
the roll-axes moment serve as zero exists in the vola grounding point and side 
top of the support polygon which a road surface forms, or its inside. 
[0174] Many of proposals about posture stability control of a leg formula mobile 
robot or the fall prevention at the time of a walk are used as a norm of stability 
distinction of a walk of this ZMP. The 2-pair-of-shoes walk pattern generation 
based on a ZMP norm can set up the point landing [ vola ] beforehand, and has- 
the advantage of being easy to take the kinematic constraint of the tip of a foot 
according to a road surface configuration into consideration. Moreover, since 
making ZMP into a stability distinction norm means treating not the force but an 
orbit as desired value on kinematic control, feasibility increases technically. In 
addition, the point which applies ZMP to the stability distinction norm of a bipedal 
robot at the conceptual list of ZMP is indicated by Miomir Vukobratovic work 
"LEGGED LOCOMOTION ROBOTS" (work outside Ichiro Kato "a bipedal robot 
and an artificial guide peg" (Nikkan Kogyo Shimbun)). 

[0175] Generally, robots of a 2-pair-of-shoes walk like a humanoid have a high 



center-of-gravity location, and the ZMP stable zone at the time of a walk is 
narrower than quadrapedalism. Therefore, the problem of the posture fluctuation 
accompanying change of such a road surface condition is divided in a 
2-pair-of-shoes bipedal robot, and becomes important. 

[0176] as mentioned above, humanoid robot equipment 200 — each sub control 
sections 235 and 245, etc. - the receiving command from the main control 
section 281 - interpreting — each actuator A2 and A3 — a drive control signal is 
outputted to ... and the drive of each unit is controlled. Thereby, humanoid robot 
equipment 200 is stabilized into a target posture, changes and can be walked 
with the stable posture. 

[0177] Moreover, in the control unit 280 in humanoid robot equipment 200, 
various sensors, such as an acceleration sensor, a touch sensor, and a 
touch-down check sensor, and the image information from a CCD camera, the- 
speech information from a microphone, etc. are generalized other than attitude 
control which was mentioned above, and it is processing. In the control unit 280, 
although not illustrated, various sensors, such as an acceleration sensor, a 
gyroscope sensor, a touch sensor, a distance robot, a microphone, and a 
loudspeaker, each actuator, the CCD camera, and the dc-battery are connected 
with the main control section 281 through the hub which corresponds 
respectively. 



[0178] The main control section 281 incorporates sensor data, and the image 
data and voice data which are supplied from each above-mentioned sensor one 
by one, and carries out sequential storing of these through an internal interface 
in the predetermined location in DRAM, respectively. Moreover, the main control 
section 281 incorporates the dc-battery residue data showing the dc-battery 
residue supplied from a dc-battery one by one, and stores this in the 
predetermined location in DRAM. Each sensor data stored in DRAM, image data, 
voice data, and dc-battery residue data are used in case the main control section 
281 performs motion control of this humanoid robot equipment 200. 
[0179] At the time of the first stage when the power source of humanoid robot 
equipment 200 was switched on, the main control section 281 reads a control 
program, and stores this in DRAM. Moreover, the main control section 281 
judges the situation of self and a perimeter, the existence of the directions from a 
user, and influence, etc. from the main control section 281 based on each 
sensor data by which sequential storing is carried out, image data, voice data^ 
and dc-battery residue data to DRAM as mentioned above. Furthermore, the 
main control section 281 makes humanoid robot equipment 200 take actions, 
such as the so-called "gesture" and a "gesture", by making a required actuator 
drive based on the decision result concerned while opting for action according to 
a self situation based on the control program stored in this decision result and 



DRAM. 

[0180] Therefore, humanoid robot equipment 200 judges the situation of self and_ 
a perimeter based on a control program, and can act autonomously according to 
the directions from a user, and influence. Moreover, humanoid robot equipment 
200 matches and determines the reading presumed from the alphabetic 
character from which the method (reading) of the pronunciation of the alphabetic 
character extracted from the image picturized in the CCD camera was extracted, 
and the voice collected in the sound-collecting microphone. Therefore, the 
precision of the speech recognition of humanoid robot equipment 200 improves, 
and a new word can register with the dictionary for speech recognition. 
[0181] 

[Effect of the Invention] As explained to the detail above, the robot equipment 
concerning this invention A storage means for speech recognition by which 
correspondence relation with the method of the pronunciation of a word and this 
word was memorized as a dictionary for speech recognition, A word phonetic 
storage means by which correspondence relation with the phonogram of a word 
and this word was memorized as a word phonetic table, An image pick-upL 
means to picturize a photographic subject, and an image recognition means to 
extract the image of a predetermined pattern from the image picturized in the 
image pick-up means, A sound-collecting means to acquire a surrounding sound, 



and a speech recognition means to recognize voice from the sound acquired in 
the sound-collecting means, Two or more kinds of phonograms presumed from 
the image of the predetermined pattern extracted in the image recognition 
means are given based on a word phonetic table. A pronunciation information- 
generation means to generate the method of pronunciation, and the voice wave 
equivalent to pronunciation to each of two or more kinds of given phonograms, 
The voice wave of the voice recognized in each voice wave and speech 
recognition means which were generated in the pronunciation information 
generation means is compared, and the dictionary for speech recognition is 
equipped with a storage control means to memorize newly noting that it is the 
method of the pronunciation of an alphabetic character which extracted the 
nearest voice wave. 

[0182] The robot equipment concerning this invention gives two or more kinds of 
phonograms presumed from the image of the predetermined pattern extracted 
from the image picturized in the image pick-up means based on a word phonetic 
table. The method of pronunciation and the voice wave equivalent to 
pronunciation are generated to each of two or more kinds of given phonograms. 
It determines noting that it is the method of the pronunciation of an alphabetic 
character which compared the voice wave of the voice recognized in each voice- 
wave and speech recognition means which were generated in the pronunciation 



information generation means, and extracted the nearest voice wave. 
[0183] Therefore, according to the robot equipment concerning this invention, 
the bad influence by incorrect recognition of the pronunciation which contains 
weak phonemes (for example, /s/of the initial of the word etc.) especially, change 
of the input phoneme under the effect of a surrounding noise, failure in voice 
section detection, etc. is inhibited, and the recognition precision at the time of_ 
registering a new word can be improved. Since the method of exact 
pronunciation can memorize in the dictionary for speech recognition by this, the 
recognition precision at the time of recognizing the word registered as a new 
word improves.. 

[0184] Moreover, the robot equipment concerning this invention is equipped with 
a word information storage means by which word information including the 
phonogram and word attribute of a word and this word was memorized as a 
word attribute table, with the method of the pronunciation of the alphabetic 
character which a storage control means memorizes newly, and this alphabetic 
character, makes a word attribute correspond and memorizes it in the dictionary 
for speech recognition. 

[0185] Therefore, while according to the robot equipment concerning this 
invention it becomes unnecessary for a user to input the word attribute 
information which is needed when applying the syntax rule, a dialogue regulation, 



etc. to the inputted voice and the voice to output and convenience improves, 
when a user does not know attribute information, un-arranging [ that attribute 
information was not able to be inputted ] is improved. 

[0186] Moreover, a storage means for speech recognition by which, as for the 
character reader concerning this invention, correspondence relation with the 
method of the pronunciation of a word and this word was memorized as a 
dictionary for speech recognition, A word phonetic storage means by which 
correspondence relation with the phonogram of a word and this word was- 
memorized as a word phonetic table, An image pick-up means to picturize a 
photographic subject, and an image recognition means to extract the image of a 
predetermined pattern from the image picturized in the image pick-up means, A 
sound-collecting means to acquire a surrounding sound, and a speech 
recognition means to recognize voice from the sound acquired in the 
sound-collecting means, Two or more kinds of phonograms presumed from the 
alphabetic character extracted in the image recognition means are given based 
on a word phonetic table. A pronunciation information generation means to 
generate the method of pronunciation, and the voice wave equivalent to 
pronunciation to each of two or more kinds of given phonograms, The voice 
wave of the voice recognized in each voice wave and speech recognition means 
which were generated in the pronunciation information generation means is 



compared, and the dictionary for speech recognition is equipped with a storage 
control means to memorize newly noting that it is the method of the 
pronunciation of an alphabetic character which extracted the nearest voice wave. 
[0187] Therefore, according to the character reader concerning this invention, 
the bad influence by incorrect recognition of the pronunciation which contains 
weak phonemes (for example, /s/of the initial of the word etc.) especially, change 
of the input phoneme under the effect of a surrounding noise, failure in voice 
section detection, etc. is inhibited, and the recognition precision at the time of 
registering a new word can be improved. Since the method of exact 
pronunciation can memorize in the dictionary for speech recognition by this, the 
recognition precision at the time of recognizing the word registered as a new 
word improves. 

[0188] Moreover, the character reader concerning this invention is equipped with 
a word information storage means by which word information including the 
phonogram and word attribute of a word and this word was memorized as a 
word attribute table, with the method of the pronunciation of the alphabetic 
character which a storage control means memorizes newly, and this alphabetic 
character, makes a word attribute correspond and memorizes it in the dictionary 
for speech recognition. 

[0189] Therefore, while according to the character reader concerning this 



invention it becomes unnecessary for a user to input the word attribute 
information which is needed when applying the syntax rule, a dialogue regulation, 
etc. to the inputted voice and the voice to output and convenience improves, 
when a user does not know attribute information, un-arranging [ that attribute 
information was not able to be inputted ] is improved. 

[0190] Moreover, the image pick-up process at which the character recognition 
approach concerning this invention picturizes a photographic subject, The image 
recognition process which extracts the image of a predetermined pattern from 
the image picturized in the image pick-up process, The sound-collecting process 
which acquires a surrounding sound, and the speech recognition process which 
recognizes voice from the sound acquired in the sound-collecting process, Two 
or more kinds of phonograms presumed from the alphabetic character extracted^ 
in the image recognition process are given based on the word phonetic table on 
which correspondence relation with the phonogram of a word and this word was 
memorized. The pronunciation information generation process which generates 
the method of pronunciation, and the voice wave equivalent to pronunciation to 
each of two or more kinds of given phonograms, The voice wave of the voice 
recognized in each voice wave and speech recognition process which were 
generated in the pronunciation information generation process is compared. The 
dictionary for speech recognition which memorized correspondence relation with 



the method of the pronunciation of a word and this word is equipped with the 
storage control process memorized newly noting that it is the method of the 
pronunciation of an alphabetic character which extracted the nearest voice wave. 
[0191] Therefore, according to the character recognition approach concerning 
this invention, the bad influence by incorrect recognition of the pronunciation 
which contains weak phonemes (for example, /s/of the initial of the word etc.) 
especially, change of the input phoneme under the effect of a surrounding noise^ 
failure in voice section detection, etc. is inhibited, and the recognition precision 
at the time of registering a new word can be improved. Since the method of 
exact pronunciation can memorize in the dictionary for speech recognition by 
this, the recognition precision at the time of recognizing the word registered as a 
new word improves. 

[0192] Moreover, according to the character recognition approach concerning 
this invention, it has a word information storage means by which word 
information including the phonogram and word attribute of a word and this word 
was memorized as a word attribute table, and with the method of the 
pronunciation of the alphabetic character which a storage control means 
memorizes newly, and this alphabetic character, a word attribute is made to 
correspond and it memorizes in the dictionary for speech recognition. 
[0193] Therefore, while according to the character recognition approach 



concerning this invention it becomes unnecessary for a user to input the word 
attribute information which is needed when applying the syntax rule, a dialogue 
regulation, etc. to the inputted voice and the voice to output and convenience 
improves, when a user does not know attribute information, un-arranging [ that 
attribute information was not able to be inputted ] is improved. 
[0194] Furthermore, the image pick-up processing whose control program 
concerning this invention picturizes a photographic subject, The image 
recognition processing which extracts the image of a predetermined pattern from 
the image picturized by image pick-up processing, The sound-collecting^ 
processing which acquires a surrounding sound, and the speech recognition 
processing which recognizes voice from the sound acquired by sound-collecting 
processing, Two or more kinds of phonograms presumed from the alphabetic 
character extracted by image recognition processing are given based on the 
word phonetic table on which correspondence relation with the phonogram of a 
word arid this word was memorized. The pronunciation information generation 
processing which generates the method of pronunciation, and the voice wave 
equivalent to pronunciation to each of two or more kinds of given phonograms, 
The voice wave of the voice recognized in each voice wave and speech 
recognition processing which were generated by pronunciation information 
generation processing is compared. Robot equipment is made to perform 



storage processing newly memorized in the dictionary for speech recognition 
which memorized correspondence relation with the method of the pronunciation 
of a word and this word noting that it is the method of the pronunciation of an 
alphabetic character which extracted the nearest voice wave. 
[0195] Therefore, according to the control program concerning this invention, the 
bad influence by incorrect recognition of the pronunciation which contains weak 
phonemes (for example, /s/of the initial of the word etc.) especially, change of 
the input phoneme under the effect of a surrounding noise, failure in voice 
section detection, etc. is inhibited, and the recognition precision of robot 
equipment at the time of registering a new word improves. Since the method of 
exact pronunciation can memorize in the dictionary for speech recognition by^- 
this, the recognition precision at the time of recognizing the word registered as a 
new word improves. 

[0196] Moreover, by recording an above-mentioned control program on a record 
medium, and offering it, reading of this record medium is possible and the 
recognition precision at the time of registering a new word improves to the 
electronic equipment which has a function as a voice recognition unit equipped 
with an image recognition means and a speech recognition means. Since the 
method of exact pronunciation is memorizable by this, the recognition precision 
at the time of recognizing the word registered as a new word improves. 



DESCRIPTION OF DRAWINGS 



[Brief Description of the Drawings] 

[Drawing 1] It is the external view showing the appearance of the robot 
equipment shown as an example of 1 configuration of this invention. 
[Drawing 2] It is the block diagram showing the configuration of the robot 
equipment shown as an example of 1 configuration of this invention. 
[Drawing 3] It is the block diagram showing the configuration of the image 
speech recognition section in the robot equipment shown as an example of 1 
configuration of this invention. 

[Drawing 4] It is drawing explaining the dictionary for speech recognition of the 
robot equipment shown as an example of 1 configuration of this invention. 
[Drawing 5] It is drawing explaining the word reading attribute table of the robot_ 
equipment shown as an example of 1 configuration of this invention. 
[Drawing 6] It is drawing explaining the alphabetic character reading table of the 
robot equipment shown as an example of 1 configuration of this invention. 
[Drawing 7] The robot equipment shown as an example of 1 configuration of this 
invention is a flow chart explaining the processing which registers a new word 
into the dictionary for speech recognition. 

[Drawing 8] It is drawing explaining the dictionary for recognition of the robot 



equipment shown as an example of 1 configuration of this invention for new 
words. 

[Drawing 9] It is a flow chart explaining the processing which generates the 
method (reading) of the pronunciation of the character string which the robot 
equipment shown as an example of 1 configuration of this invention has 
recognized. 

[Drawing 10] It is the block diagram showing the software configuration of the 
control program of the robot equipment shown as an example of 1 configuration 
of this invention. 

[Drawing 11] It is the block diagram showing the configuration of a middleware 
layer among the control programs of the robot equipment shown as an example 
of 1 configuration of this invention. 

[Drawing 12] It is the block diagram showing the configuration of an application 
layer among the control programs of the robot equipment shown as an example 
of 1 configuration of this invention. 

[Drawing 13] It is the block diagram showing the configuration of a behavioral 
model library among the control programs of the robot equipment shown as an 
example of 1 configuration of this invention. 

[Drawing 14] It is a mimetic diagram explaining the finite stochastic automaton 
which is an algorithm for opting for action of the robot equipment shown as an 



example of 1 configuration of this invention. 

[Drawing 15] It is drawing showing the state-transition conditions for opting for — 
action of the robot equipment shown as an example of 1 configuration of this 
invention. 

[Drawing 16] It is an external view explaining the appearance seen from the front 
of the humanoid robot equipment shown as an example of 1 configuration of this 
invention. 

[Drawing 17] It is an external view explaining the appearance seen from the back 
of the humanoid robot equipment shown as an example of 1 configuration of this 
invention. 

[Drawing 18] It is drawing showing typically the degree-of-freedom configuration 
model of the humanoid robot equipment shown as an example of 1 configuration 
of this invention. 

[Drawing 19] It is drawing explaining the control-system configuration of the 
humanoid robot equipment shown as an example of 1 configuration of this 
invention. 

[Drawing 20] Drawing 20 (a) is the mimetic diagram showing the conventional — 
speech recognition approach which applied the garbage model which makes a 
"phoneme" a base unit, and drawing 20 (b) is the mimetic diagram showing the 
conventional speech recognition approach which applied the garbage model 



which makes "kana" a base unit. 
[Description of Notations] 

1 Robot Equipment, 2 Idiosoma Unit, 3A, 3B and 3C, 3D Leg Unit, 4 A head unit, 
5 A tail section unit, 10 CPU, 11 DRAM, 12 A flash ROM, 13 PC card interface 
circuitry, 14 A digital disposal circuit, 15 An internal bus, 16 Control section, 17 A 
dc-battery, 18 angular-velocity sensor, 19 An acceleration sensor, 20 CCD 
camera, 21 A touch sensor, 22 A distance robot, 23 A microphone, 24 
Loudspeaker, 251 -25n An actuator, 261 -26n Potentiometer, 271 -27n A hub, 28 
memory cards, 100 Image speech recognition section, 1 10 The dialogue 
Management Department, a 111 dialogue regulation table, 120 Speech 
recognition section, The dictionary for 121 speech recognition, 122 A sound 
model, 123 Language model, The 124 sonagraphy sections, 130 The output ~~ 
generation section, 140 Image-processing character recognition section, 141 A 
character-pattern database, 150 The pronunciation information generation 
section, 151 A word reading attribute table, 152 An alphabetic character reading 
table, 153 A reading grant table, 200 humanoid-robot equipment 



